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Abstract 

We present a reduced-dimension multiuser detector (RD-MUD) structure that significantly decreases the 
number of required correlation branches at the receiver front-end, while still achieving performance similar to 
that of the conventional matched-filter (MF) bank. RD-MUD exploits the fact that the number of active users 
is typically small relative to the total number of users in the system and relies on ideas of analog compressed 
sensing to reduce the number of correlators. The correlating signals used by each correlator are chosen as an 
appropriate linear combination of the users' spreading waveforms, which in turn are chosen from a large class 
of spreading codes. We derive the probability-of-error when using two methods for recovery of active users 
and their transmitted symbols: the reduced-dimension decorrelating (RDD) detector, which combines subspace 
projection and thresholding to determine active users and sign detection for data recovery, and the reduced- 
dimension decision-feedback (RDDF) detector, which combines decision-feedback orthogonal matching pursuit 
for active user detection and sign detection for data recovery. We identify conditions such that error is dominated 
by active user detection. We then show that the number of correlators needed to achieve a small probability-of- 
error under these conditions is on the order of the logarithm of the number of users in the system for a given 
projection method based on random discrete Fourier transform (DFT) matrices, which is significantly lower than 
the number of correlators required by the the conventional MUD using the MF-bank. Our theoretical results 
take into consideration the effects of correlated signature waveforms as well as near-far issues. The theoretical 
performance results for both detectors are validated via numerical simulations. 

I. Introduction 

Multiuser detection (MUD) is a classical problem in multiuser communications and signal processing 
(see, e.g., [1][2][3] for classic reviews with extensive reference lists and [4][5][6] for some recent 
development in the field). In multiuser systems, the users communicate simultaneously with a given 

Yao Xie (Email: yaoxieS Stanford . edu) is with the Department of Electrical Engineering at Stanford University. 

Yonina C. Eldar (Email: yonina@ee. technion . ac . il) is with the Department of Electrical Engineering, Technion, Israel Institute 
of Technology, and was visiting the Department of Electrical Engineering, Stanford University. 

Andrea Goldsmith (Email: andrea@wsl . Stanford . edu) is with the Department of Electrical Engineering at Stanford University. 

This work is partially supported by the Interconnect Focus Center of the Semiconductor Research Corporation, BSE Transformative 
Science Grant 2010505, and a Stanford General Yao-Wu Wang Graduate Fellowship. 



2 



receiver by modulating information symbols onto their unique signature waveforms, which can be 
characterized by the chip waveform and the signature sequences (also called the spreading code) [ ]. 
The received signal consists of a noisy version of the superposition of the transmitted waveforms. The 
MUD has to detect the symbols of all users simultaneously. The channel model associated with the 
MUD may be synchronous or asynchronous. In the synchronous channel model [8], the transmission 
rate of all users is the same and their symbol epochs are perfectly aligned. This requires closed-loop 
timing control or synchronization among all transmitters. In the asynchronous channel model [9] [10], 
user time epochs need not be aligned and the transmitted waveforms arrive at the receiver with different 
time delays. Allowing users to be asynchronous simplifies system design but complicates the system 
model. The synchronous channel model can be viewed as a special case of the asynchronous channel 
with delays of all users to be the same. In this paper we will focus on the synchronous channel 
model. Part of the MUD problem is signature sequence selection, for which there has been a large 
body of work, both theoretical [1 1][12][13][14][15][16][17][18][19][20][21] and practical [7][22]. If 
we require the signature waveforms to be orthogonal, for a system with bandwidth B, the number of 
orthogonal signature waveforms available (and hence the number of users that the system can support) 
is approximately 2TB [ ]. This hard limit on system capacity can be relaxed if we allow nonorthogonal 
signature waveforms, and instead require the crosscorrelation of the selected signature waveforms to 
be sufficiently low. In this work, we do not consider optimizing signature waveforms and hence our 
results will be parameterized by the crosscorrelation properties of the signature waveforms used in our 
design. When the signature waveforms are nonorthogonal, there is mutual interference among users, 
which degrades system performance for all users. Hence, an important issue in multiuser systems with 
nonorthogonal signature waveforms is the near-far problem [ ]: a strong user with high enough power 
at the receiver may cause severe performance degradation of a weak user. One of the key challenges in 
MUD is to design a detector that works well when user signals are received at different power levels. 
Such detectors are discussed in more details below. 

While there has been a large body of work developed for the multiuser detection problem over 
the last several decades, it is not yet widely implemented in practice, largely due to its complexity 
and high-precision A/D requirements. The complexity of MUD arises both in the analog circuitry for 
decorrelation as well as in the digital signal processing for data detection of each user. We characterize 
the decorrelation complexity using the number of correlators used at the receiver front-end, and measure 
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the data detection complexity by the complexity -per-bit [c ], which is the number of real floating point 
operations required per bit decision. 

The conventional MUD detection structure consists of a matched-filter (MF) bank front-end followed 
a linear or nonlinear digital multiuser detector. The MF-bank front-end is a set of correlators, each 
correlating the received signal with the signature waveform of a different user. Hence the conventional 
MUD requires the number of correlators to be equal to the number of users. The MF-bank front-end 
obtains a set of sufficient statistics for MUD when the receiver noise is Gaussian. 

To recover user data from the MF-bank output, various digital detectors have been developed. Verdu 
in a landmark paper [ ] establishes the optimal MUD detector as the maximum likelihood sequence 
estimator (MLSE), which minimizes the probability of error for symbol detection. The upper bound on 
the probability of error for symbol detection, or the bit-error-rate (BER), is due to [ ] and the analysis 
is also given in [ ]. Although the MLSE detector can nearly eliminate the degradation in performance 
due to multiuser interference, it has two main limitations: complexity and required complete channel 
state information [ ]. The complexity-per-bit of the MLSE detector is exponential in the number of 
users when the signature waveforms are nonorthogonal. To address the complexity issue, other MUD 
suboptimal detectors have been developed. The MLSE detector is an example of a nonlinear detector that 
detects symbols of all users jointly. Another example of nonlinear detectors is the decision feedback (DF) 
detector [24][25], which is based on the idea of interference cancellation and takes various forms. One 
such form is the successive interference cancellation (SIC) detector [26] [27]. This idea of interference 
cancellation dates back to the information theoretic study of the Gaussian multiple- access channel [ ]. 
The DF detector decodes symbols iteratively and subtracts the detected symbols of strong users first to 
facilitate detection of weak users. The DF detector also requires complete channel state information but 
it has less complexity-per-bit than the MLSE detector. The number of correlators required by the DF 
detector is also equal to the number of users. The DF detector is a good compromise between complexity 
and performance among all nonlinear and linear techniques (see, e.g., [i^]). For this reason we will 
analyze the DF detector below as an example of a nonlinear detector, but in a reduced dimensional 
setting. 

Linear detection methods, which apply a linear transform to the receiver front-end output and then de- 
tect each symbol separately, have lower complexity than nonlinear methods but also worse performance. 
Linear MUD techniques include the single-user detector, the decorrelating detector and the minimum 
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mean- square-error (MMSE) detector. The single-user detector is the simplest linear detector, which 
follows the MF-bank front-end the conventional single-user detection in each branch, which detects 
symbols by comparing the front-end output with a threshold [ ]. When the signature waveforms are 
orthogonal and synchronous, the single-user detector coincides with the MLSE detector and it minimizes 
the probability-of-error. In this case, by correlating with each user's signature waveform, the MF-bank 
maximizes the output signal-to-noise ratio (SNR) for each individual user. When the signature waveforms 
are nonorthogonal, users interfere with each other, so the probability-of-error of the single-user detector 
degrades [ ]. A linear detector that eliminates user inference is the decorrelating detector [8][1U], which, 
for each user, projects the received signal onto the subspace associated with the signature waveform 
of that user. This projection amplifies noise when the signature waveforms are nonorthogonal. The 
decorrelating detector also gives the best joint estimate of symbols and amplitudes in the absence of 
knowledge of the complete channel state information [ ], and it optimizes the near-far resistance among 
linear detectors [29]. The near-far resistance is a performance measure for the degree of robustness 
against the near-far problem achieved by a multiuser detector [ ]. The MMSE detector [30] [31] [32] is 
designed to minimize the mean- square-error (MSE) between symbols and the linear transformed MF- 
bank front-end output. The MMSE detector takes into account the background noise and interference, 
and hence to some extent it mitigates the noise amplification effect of the decorrelating detector in the 
low and medium SNR regimes [ ]. A drawback of the MMSE detector is that it requires complete 
channel state information and it does not achieve the optimal near-far resistance. When the signal-to- 
noise power ratio (SNR) goes to infinity, the MMSE detector converges to the decorrelating detector 
[3]. Because of the many advantages of the decorrelating detector, it has received much attention in the 
MUD literature and is one of the most common linear detectors in MUD [ ]. Hence, in this paper, we 
will focus on the decorrelating detector as an example of a linear detector in the reduced-dimension 
setting. The decorrelating detector requires inverting the correlation matrix of the signature waveforms, 
but this computation can be done offline once the signature waveforms are selected. Its complexity- 
per-bit (other than inverting the correlation matrix) can be shown to be linear in the number of users 
[8]. 

Both linear and nonlinear MUDs have sufficiently high complexity to preclude their wide adoption in 
deployed systems. They both require the number of correlators at the receiver front-end to be equal to 
the number of users in the system. The complexity-per-bit of the nonlinear detectors are exponential in 
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the number of users in system for the MLSE detector, and less for the DF detector. The complexity-per- 
bit of the linear detectors are linear in the number of users, which is much less than that of the nonlinear 
methods. In a typical communication system, there may be thousands of users. Hence, the complexity of 
the conventional methods has been a major obstacle for implementing the MF-bank based conventional 
MUD detectors. Our methods reduce the front-end complexity far below that of the nonlinear and linear 
techniques in large systems, and their complexity-per-bit are comparable to those of the conventional 
nonlinear and linear techniques. For example, when the random partial discrete Fourier transform (DFT) 
matrix is used to construct the correlating signals, the number of correlators used by our methods is 
proportional to the logarithm of the number of users and proportional to the square of the number of 
active users, and the complexity-per-bit of our method is comparable to the corresponding conventional 
MUD detectors. 

In this work, we develop a low complexity front-end for MUD along with corresponding digital 
detectors. We call this structure a reduced-dimension multiuser detector (RD-MUD). The RD-MUD 
reduces the number of correlators while still achieving performance similar to that of conventional 
MUDs that are based on the MF-bank front-end. We reduce complexity by exploiting the fact that at 
any given time the number of active users, K, is typically much smaller than the total number of users, 
N. This analog signal sparsity allows us to use techniques from analog compressed sensing, which 
exploits sparsity in analog signals (see, e.g., [33][34][35][36][37][38][39] and [40] for more details on 
these techniques). Our RD-MUD has a front-end that correlates the received signal with M correlating 
signals, where M is much smaller than N. The correlating signals are formed as linear combinations 
of the signature waveforms via a (possibly complex) coefficient matrix A as is done in the analog 
compressed sensing literature for sparse signal recovery. Our choice of A will be shown to be crucial 
for performance. The output of the RD-MUD front-end can thus be viewed as a projection of the MF- 
bank output onto a lower dimensional detection subspace. To recover information from this detection 
subspace, we process the front-end output using algorithms that combine ideas from compressed sensing 
and MF-bank based conventional MUD. We study two such detectors in detail: the reduced-dimension 
decorrelating (RDD) detector, a linear detector that combines subspace projection and thresholding 
to determine active users with a sign detector for data recovery [41] [42], and the reduced-dimension 
decision-feedback (RDDF) detector, a nonlinear detector that combines decision-feedback orthogonal 
matching pursuit (DF-OMP) [43] [44] for active user detection with sign detection for data recovery in 
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an iterative manner. We present theoretical probability-of-error performance guarantees for these two 
specific detectors in terms of the coherence of the matrix A in a non- asymptotic regime with a fixed 
number of users and active users. Based on these results, we develop a lower bound on the number 
of correlators M needed to attain a certain probability-of-error performance. For example, if A is a 
random partial discrete Fourier transform matrix, the M required by these two specific detectors is on 
the order of log A^. We validate these theoretical performance results via numerical examples. 

Previous work on MUD based on active user detection falls into two categories: conventional methods 
and compressed sensing methods. Specifically, to detect active users in the system, conventional MUD 
approaches use techniques such as the multiple signal classification algorithm (MUSIC) [45], quickest 
change detection [46], random set theory, sphere detection and Bayesian filter [47] [48] [49] [50]. In 
particular, [-ryi] focuses on the on-online detection of the entrance of a new user into the system, and 
[45] [4 7] [50] consider the case when the number of active users is unknown. There has also been work 
about detecting a subset of active users of interest [51], which belongs to a more general technique 
called group detection [ ]. The group detection technique partitions active users into groups and jointly 
detects users within a group using the generalized likelihood ratio test. A compressed sensing approach 
has also been applied for active user detection. However, in contrast to our approach, which processes 
analog signals, most existing work on exploiting compressed sensing ideas for signal detection based on 
the original compressed sensing results [53] [54] assume discrete signals. In particular, most prior work 
on MUD exploiting user sparsity applies compressed sensing techniques on discrete signals via matrix 
multiplication [55][56][57][58][59][60], whereas in our work the compressed sensing techniques are 
incorporated into the RD-MUD analog front-end. Furthermore, RD-MUD aims at detecting active users 
as well as their transmitted symbols, whereas prior work [61] [62] [55] [56] [57] aims at detecting only 
the active users and hence the problem is equivalent to support recovery. These prior works establish 
conditions on the number of correlators M required to achieve a zero probability-of-error of active 
user detection when the number of users tends to infinity. While providing important insights into 
complexity reduction in large systems, they did not answer questions for practical system design with 
a finite number of users, such as how many correlators should be used to achieve a target probability- 
of-error. There is another branch of compressed detection work that focuses on detecting the presence 
of a discrete signal that is sparse in time [63] [59] [64]. This work is not relevant to our problem since 
the MUD signal we consider is sparse in the number of users. 
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Our RD-MUD consists of two stages: active user detection and data detection of active users. The 
first stage is closely related to [ ]. However, our problem differs in that the probability-of-error must 
consider errors in both stages. We derive conditions under which the probability-of-error is dominated 
by errors in the first stage. Also, the decision-feedback (DF) detector we consider is different in its active 
user detection because it subtracts out detected data symbols whose values are from a finite alphabet 
(rather than subtracting out the estimated data symbols whose values are real) from the second stage, 
which makes detection of the remaining active users easier. 

The rest of the paper is organized as follows. Section II discusses the system model and reviews 
conventional detectors using the MF-bank front-end. Section III presents the RD-MUD front-end and 
detectors. Section IV contains the theoretical performance guarantee of two RD-MUD detectors: the 
RDD and RDDF detectors. Section V validates the theoretical results through numerical examples, and 
finally Section VI concludes the paper. 

II. System Model 

A. Notation 

The notation we use is as follows. Vectors and matrices are denoted by boldface lower-case and 
boldface upper-case letters, respectively. The real and complex numbers are denoted by M and C, 
respectively. The real part of a scalar x is represented as 3fJ[x]. The conjugate of a complex number x is 
denoted by x*. The set of indices of the nonzero entries of a vector x is called the support of x. Given 
an index set X, Xj denotes the submatrix formed by the columns of a matrix X indexed by X, and 
xi denotes the subvector formed by the entries indexed by X. The entry of a matrix X at the nth row 
and mth column is indicated by [X]„m. The identity matrix is denoted by I. The transpose, conjugate 
transpose, and inverse of a matrix X are denoted by X^, X^, and X^^, respectively. The inner product 
between two vectors x, y G M^^^ with entries s„ and respectively, is defined as: x^y = J2n=i ^nVn- 
The I2 norm of a vector x is denoted by ||x|| = (x^x)^/^. The minimum and maximum eigenvalues 
of a positive-semidefinite matrix X are denoted by Amin(X) and Ainax(X), respectively. The trace of a 
square matrix X is denoted as tr(X). The spectral norm of X is denoted by p(X) = [Amax(X^X)]i/2_ 
The function 5nm is defined such that 5nm = 1 only when n = m and otherwise is equal to 0. The sign 
function is defined as sgii(x) = 1, if x > 0, sgii(x) = —1, if x < 0, and otherwise is equal to 0. The 
expectation of a random variable x is denoted as E{x} and the probability of an event A is denoted 
as P{A). The union, intersection, and difference of two sets {A} and {B} are denoted by {A} U {-B}, 
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{yl}n{i?}, and {A}/{B}, respectively. The complement of a set {A} is denoted as {Ay. The notation 
A c B means that set A is a subset of set B. 

B. System Model 

Consider a multiuser system with users. Each user is assigned a unique signature waveform from 
a set iS = : [0, T] — )• M, 1 < n < A^}. The signature waveforms are assumed given and known, 

and possess certain properties discussed in more detail below. Each user modulates its data signal by its 
signature waveform to transmit its data symbols. The symbols carry information and are chosen from 
a constellation depending on the specific modulation scheme. The duration of the data symbol T is 
referred to as the symbol time, which is also equal to the inverse of the data rate for binary modulation. 

Define the inner product (or crosscorrelation) between two real analog signals x{t) and y{t) in L2 

as 

1 r 

{x{t),y{t)) = -j^ x{t)y{t)dt (1) 
over the symbol time T. We also define the L2 norm of a real analog signal x{t) as 

||a;(t)|| = (x(t),x(t))V2. (2) 

Two signals are orthogonal if their crosscorrelation is zero. We assume that the signature waveforms 
are linearly independent. That is, any linear combinations of different signature waveforms cannot be 
another signature waveform (or its multiples) in order to avoid cancellation of each other's transmission. 
The crosscorrelations of the signature waveforms are characterized by the Gram matrix G, defined as 

[G]„; ^ (s„(t), Slit)), l<n<N, 1<1<N. (3) 

For convenience, we assume that s„(t) has unit energy: ||s„(t)|| = 1 for all n so that [G]„„ = 1. Due 
to our assumption of linear independence of the signature waveforms, G is invertible. The signature 
waveforms typically have low crosscorrelations, so we also assume that the magnitudes of the off- 
diagonal elements of G are much smaller than 1. 

We consider a synchronous MUD model with Binary Phase Shift Keying (BPSK) modulation [ ]. 
There are K active users out of N possible users transmitting to the receiver. The set X contains indices 
of all active users, and its complement set I'^ contains indices of all non-active users. The active users 
modulate their signature waveforms using BPSK modulation with the symbol of the user n denoted by 
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bn G {1,-1}, for n G X. The nth user transmits its modulated signal at power P„ through a wireless 
channel with channel amplitude gn- We assume that the channel amplitude (?„ can be estimated and 
is known to the receiver (CSIR). Define the gain r„ = \fRngn where we assume r„ is also known at 
the receiver. For simplicity, we assume that gn is real and, hence, r„ is real and can be negative. The 
nonactive user can be viewed as transmitting with power P„ = 0, or equivalently transmitting zeros: 
hn = 0, for n E T^. 

The received signal y{t) is a superposition of the transmitted signals from the active users, plus white 
Gaussian noise w(t) with zero-mean and variance a^: 



with hn G {1, —1}, n eX, and 6„ = 0, n G X^. In the presence of delays of the asynchronous channel 
model, the ideas can be combined with the methods developed in [39] for time-delay recovery from 
low-rate samples. 

The goal of multiuser detection (MUD) is to detect the set of active users, i.e. users with indices in 
X, and their transmitted symbols {6„ : n G X}. In practice the number of active users K is typically 
much smaller than the total number of users A^, which is a form of user sparsity. As we will show, this 
user sparsity enables us to reduce the number of correlators at the front-end and still be able to achieve 
performance similar to that of a conventional MUD using a bank of MFs. To simplify the detection 
algorithm, we assume that K is known. To consider joint estimation of K as well as the active users 
and their symbols greatly complicates analysis for the probability-of-error, since in that case the error 
will come from three sources: estimating the wrong number of active users, estimating the wrong set of 
active users, and estimating the wrong symbols. By assuming K is known, we will consider the latter 
two sources of errors in our analysis. The problem of estimating K can be treated using techniques 
such as these in [ ] and [47]. 

In the following subsection, we present the structure of conventional MUD detectors using the MF- 
bank front-end. In Section III we introduce the new reduced-dimension MUD (RD-MUD) front-end and 
detectors. 



N 
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Fig. 1: Front-end of (a) conventional MUD using MF-bank, and (b) RD-MUD. 
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Fig. 2: The diagram of (a) linear detector, and (b) nonlinear detector. 



C. Conventional MUD 

A conventional MUD detector has a front-end consisting of a bank of matched filters followed by a 
digital detector. In the following we review the MF-bank front-end and various digital detectors. 

1) MF-bank front-end: For single-user systems, the matched filter (MF) passes the signal s{t) through 
a filter with impulse response s*(T — t) or, equivalently, multiplies the received signal s{t) with itself 
and integrates over a symbol time: this maximizes the output SNR of the decision statistic among the 
class of all linear detectors [:)]. The MF-bank is an extension of the MF for multiple users, and it has N 
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MFs in parallel: the nth branch correlates the received signal with the corresponding signature waveform 
Sn{t), as illustrated in Fig. la. As stated earlier, the output of the MF-bank is a set of sufficient statistics 
for MUD when the gains r„ are known [ ]. Hence, no information is left in the MF-bank front end of 
the MUD receiver. 

Using the model for the received signal (4), the output of the nth correlator of the MF-bank can be 
written as 

Zn = {y(t), Sn{t)) = Tnhn + ^[G]„in6/ + U„, 1 < n < A^. (5) 

The output noise ti„ = {w{t), Snit)) is a Gaussian random variable with zero mean and covariance 
E{unit)umit)} = (J^[G]nm (for derivation see Section 2.9 in [ ]). Let z = [zi, ■ ■ ■ , znV , R G R^""^ 
be a diagonal matrix with [R]„„ = r^, b = [61, ■ ■ ■ , Bn]^ and u = [ui, ■ ■ ■ , un]~^ . We can express the 
output of the MF-bank (5) in a vector form as: 

z = GRb + u, (6) 

where u is a Gaussian random vector with zero mean and covariance E{uu^} = a^G. 

2) MF-bank detection: Conventional MUD detectors based on using the MF-bank output can be 
classified into two categories: linear and nonlinear detectors. The diagrams of these two detectors are 
illustrated in Fig. 2. In the literature, typically the basic synchronous MUD model assumes all users 
are active, i.e. 6„ G {1,-1}, and hence the goal of the MUD detectors is to detect all user symbols. 

The linear detector applies a linear transform to the MF-bank output (illustrated in Fig. 2a): 

Tz = TGRb + Tu, (7) 

and detects symbol for each user separately using a sign detector: 

K = sgn(r„[Tz]„), l<n<N. (8) 

Several commonly used linear detectors are the single-user detector, the decorrelating detector and 
the minimum- mean- square-error (MMSE) detector. The single-user detector [ ] is equivalent to having 
T = I in (7), and detecting symbols as 6„ = sgn(r„2;„). The decorrelating detector [b] is motivated by 
the fact that when the signature waveforms are nonorthogonal, even in the absence of noise, the single- 
user detector (8) will not lead to good detection, when there is non-negligible interference between 
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users (captured mathematically in (9) by the second term in parenthesis), since generally 



bn = sgn{rnZn) = sgn I rlbn + ^ [G]nirnribi 1 7^ sgn(r^6„) = b, 

\ l^n,l=l J 




(9) 



The decorrelating detector addresses this problem by applying a linear transform T = in (7). The 
decorrelating detector can recover symbols perfectly in the absence of noise; however, it also amplifies 
noise when G 7^ I and requires G to be invertible. The minimum mean square error (MMSE) detector is 
based on the criterion to minimize the mean square error (MSE) between the linear transform of the MF- 
bank output and symbols. Based on this principle, MMSE uses a linear transform T = (G + a^R^^)^^ 



The nonlinear detectors, on the other hand, detect symbols jointly and (or) iteratively as illustrated in 
Fig. 2b. The nonlinear detectors include the maximum likelihood sequence estimator (MLSE) and the 
successive interference cancellation (SIC) detector [3]. The MLSE achieves the minimum probability- 
of-error by solving the following optimization problem to detect symbols jointly: 



If the signature waveforms are orthogonal, G is a diagonal matrix and the optimization problem of 
(10) decouples into single user problems, and the optimal solution is the sign detector (8) (with 
T = I). However, when the signature waveforms are nonorthogonal this optimization problem (10) 
is exponentially complex in the number of users [ ]. It can be solved by exhaustive search which is 
computationally very expensive. The SIC detector first finds the active user with the largest gain, detects 
its symbol, subtracts its effect from the received signal, and iterates the above process using the residual 
signal. After K iterations, the SIC detector determines all active users. As we will show later, the ideas 
of SIC and the orthogonal matching pursuit (OMP) in compressed sensing are similar. 



The RD-MUD front-end, illustrated in Fig. lb, correlates the received signal y{t) with a set of 
correlating signals hmit), m = 1, ■ ■ ■ M, where M is typically much smaller than A^. This is in contrast 
to the conventional matched filter (MF) bank, which correlates the received signal with the full set of 

signature waveforms [ ]. The front-end output is processed by either a linear or nonlinear detector 
to detect active users and their symbols, as shown in Fig. 2 for both linear and nonlinear detectors. 



in (7) [3]. 



max 2y^Rb - b'^RGRb. 



(10) 



III. Reduced-Dimension MUD (RD-MUD) 
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A. RD-MUD: Front-End 

The design of the correlating signals hmit) is the key for RD-MUD to reduce the number of 
correlators. To construct these signals, we rely on biorthogonal waveforms. The related notion of the 
biorthogonal function has been used in analog compressed sensing to construct multichannel filters to 
sample the analog signal [35]. The biorthogonal signals with respect to {sn{t)} are defined as a linear 
combination of all signature waveforms using a weighting coefficient matrix G^^: 

N 

Sn{t) = J2^G-'Usi{t), l<n<N, (11) 
1=1 

and have the property that = 5nm, for all n, m, i.e. the set of signals {■Sm(t)} are 

biorthogonal to the signature waveforms {s„(t)}. This is because 

IN \ N N 

{Sn{t), sUt)) = ( Sn{t), Y,[^~^UMt) \ = Y,[G'']ml{Sn{t), Si{t)) = [G"^]^; [G]z„ = S^m- (12) 

\ 1=1 I 1=1 1=1 

Since we assume that the signature waveforms are linearly independent in Section II-B, G is invertible. 
Note that when {s„(t)} are orthogonal, G = I and Sn{t) = s„(t). 

The RD-MUD front-end uses as its correlating signals {hmit)} which are linear combinations of the 
biorthogonal waveforms. The linear combination uses (possibly complex) weighting coefficients amn 
that we choose: 

N 

hmit) = '^amnSnit), 1 < m < M. (13) 

n=l 

The performance of RD-MUD depends on these coefficients {amn}- Define a coefficient matrix A G 
^MxN ^-^^Yi [A]mn — o-mn ^ud dcuotc the nth column of A as a„ = [ai„, ■ ■ ■ , aMnV , n = 1, ■ ■ ■ , A^. 
We normalize the columns of A so that ||a„|| = 1. The design of the correlating signals is equivalent 
to the design of the coefficient matrix A for a given {s„(t)}. We will use coherence as a measure of 
the quality of A, which is defined as [67] [65]: 

/i = max la^ail . (14) 

As we will show later, it is desirable that the columns of A have small correlation such that fi is small. 
This requirement for small fi also reflects a tradeoff in choosing how many correlators M to use in the 
RD-MUD front-end. We will show later that with more correlators, the coherence of A can be lower 
and the performance of RD-MUD can be better. 
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We now derive the output of the RD-MUD front-end. From the choice of the correlating signals (13) 
and the receive signal model (4), the output of the mth correlator is given by: 

N N \ / N 



Vm = {Ki{t),y{t)) = \^aninSn{t),^rihiSi{t)\ ^ (15) 

\n=l 1=1 I \n=l I 

N N 

= '^ribi'^arnn{Sn{t),Si{t)) +Wm (16) 
1=1 n=l 
N 

= ^ amiribi + Wm, (17) 
1=1 

where we have defined Wm = (^J2n=i (^mnSn(t),w{t)'^, and used the property that (s„(t), Sm{t)) = 6nm- 
We define the output noise 

N 



n=l 



Xl«-n(s„(t),u;(t)), l<m<M. (18) 

This is a Gaussian random variable with zero-mean, variance cr^ = (j^[AG^^A^]mm, and covariance 
Pnm — ^{wnWm} = a'^[AG^^ A^]nm (for derivation see Appendix A). Denoting y = [yi, - ■ ■ jVmV ^^'^ 
w = [wi, - ■ ■ ,wm]^, we can express the RD-MUD output (17) in vector form as 

y = ARb + w, (19) 

where w is a Gaussian random vector with zero mean and covariance a^AG'^A^. The vector y can 
be viewed as a linear projection of the MF-bank front-end output onto a lower dimensional subspace 
which we call the detection subspace. Since there are at most K active users, b has at most K non-zero 
entries. The idea of RD-MUD is that when the original signal vector b is sparse, with proper choice 
of the matrix A, the detection performance for b based on y of (19) in the detection subspace can be 
similar to the performance based on z (6), the output of the MF-bank front-end. 

B. RD-MUD: Detectors 

We now discuss how to recover b from the RD-MUD front-end output y of (19) using digital detectors. 
The model for the output (19) of the RD-MUD front-end has a similar form to the observation model 
in the compressed sensing literature [56] [65], except that the noise in the RD-MUD front-end output is 
colored due to matched filtering at the front-end. Hence, to recover b, we can adopt the ideas developed 
in the context of compressed sensing, and combine them with techniques in MF-bank detection. 
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The linear detector for RD-MUD first estimates active users X using support recovery techniques 
from compressed sensing [68] [56] [69]. These support recovery techniques include thresholding [ ], 
which we will describe in more detail later and use for active user detection in the RDD detector 
and other linear detectors, and orthogonal matching pursuit (OMP) [43] [44], which we extend to the 
decision-feedback OMP and use for joint active user and symbol detection in the RDDF detector. 

Once the active users are estimated, their symbols bj can be detected. Based on the estimated index 
set of active users X (which may be different from X), we can write the RD-MUD front-end output 
model (19) as 

y = AfSijbf + w. (20) 

The symbols bj can be detected from (20) by applying a linear transform to the front-end output and 
detecting symbols separately. The nonlinear detector for RD-MUD detects active users and their symbols 
jointly (and/or iteratively). 

We will focus on recovery based on two algorithms: (1) the reduced-dimension decorrelating (RDD) 
detector, a linear detector that uses subspace projection along with thresholding [ ] to determine 
active users and sign detection for data recovery; (2) the reduced-dimension decision feedback (RDDF) 
detector, a nonlinear detector that combines decision-feedback orthogonal matching pursuit (DF-OMP) 
for active user detection and sign detection for data recovery. DF-OMP differs from the conventional 
OMP [43] [44] in that in each iteration, the binary-valued detected symbols, rather than the real- valued 
estimates, are subtracted from the received signal to form the residual used by the next iteration. The 
residual consists of the remaining undetected active users. By subtracting interference from the strongest 
active user we make it easier to detect the remaining active users. DF-OMP can also be viewed as a 
successive interference cancellation (SIC) detector [26] [27] in the detection subspace: DF-OMP detects 
each user by computing the inner product of the received signal vector and the signature waveform vector 
in the projection subspace and subtracts the effect of each user using the data model (19), whereas the 
SIC detector does this using a difference data model (6). These two algorithms are summarized in Table 
I. 

1) Reduced-dimension decorrelating (RDD) detector: The RDD detector works as follows. As per 
(19), the front-end of the RD-MUD projects the received signal y{t) onto the detection subspace as a 
vector y. By considering the RD-MUD front-end output when the input signal is s„(t), we can show 
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that the column a„ of A corresponds to the nth signature waveform vector in the detection subspace: 

N 

{hm{t),Sn{t)) = ^aml{si{t),Sn{t)) = ttmn, l<m<M. (21) 

1=1 

Considering the detection method of the conventional MUD, a natural strategy for RD-MUD is to 
match the received signal vector y and the nth signature waveform vector in the detection subspace by 
computing their inner product, which is given by a^y, n = 1, ■ ■ ■ , N. To detect active users, we can 
rank the magnitudes of these inner products and detect the index of the K largest as active users: 

X={n: if |3?[af y]| is among the K largest of |3?[af y]|, n = l,--- ,N}. (22) 

This method (22) has also been used for sparsity pattern recovery in the compressed sensing literature 
(e.g. [56]). To detect their symbols, we use sign detection: 

.^_jsg„M[<yl). ^^^^ 
[ 0, n^i. 

In detecting active users (22) and their symbols (23), we take the real parts of the inner products 
because the imaginary part of a^y contains only noise and interference. To see this, expand 

afy = rnbn+ Yl n&z(afaO+a^w, nel, (24) 

any = 5Zn6Ka>)+a>, n^I. (25) 

lei 

Recall that symbols 6„ and gains r„ are real and only A can be complex. Hence the term r„6„ in (24), 
which contains the transmitted symbol, is real, and the rest of the terms in (24) and (25), which contain 
noise and interference, are complex. For real A, (22) and (23) are equivalent to decisions based on the 
magnitudes of the inner products. 

The RDD detector computes the inner products of the received signal vector and the signature 
waveform vector a^y, n = 1, ■ ■ ■ , A^, which is equivalent to computing A^y. This requires M N floating 
point operations when A is real (or 2MN operations when A is complex) for detection of log2 3 bits 
(since equivalently we are detecting 6„ G { — 1,0,1}). Hence the complexity -per-bit of the RDD detector 
is proportional to M. For other RD-MUD linear detectors, following the inner product computation, 
linear processing can be done by multiplying an x matrix that incurs A^^ operations. Hence the 
complexity-per-bit of other RD-MUD linear detectors is proportional to M + A^. Since M < A^ in 
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TABLE I: RD-MUD Detection 



RDD detector 


RDDF detector 


Detect active users: 

find X that contains indices of the K largest 

l^[a^y]|- 

Detect symbols: 
in = sgn(r„3fJ[a^y]) for n e X, and 
6„ = for n ^ X. 


Initialize: X is empty, b*^°^ = 0, v*^°^ = y. 
Iterate K times: k — 1,--- ,K. 
Detect active user: 

Uk = argmaxn ^fjfa^v^*^"-^^] 
Detect symbol: 

ferf-' = sgn(r„^3?[af^v(''~^^]), for n = Uk, and 

bn^ = Un'^^ for n ^ Uk. 
Update residual: 

±(k) ^ J{k-1) y 1^ 

yC^) =y- ARb^'^). 



RD-MUD, the complexity for data detection of the RDD detector and other RD-MUD linear detectors 
is on the same order as that of the conventional linear MUD detector. But the RDD detector and other 
Unear RD-MUD detectors require much lower decorrelation complexity in the analog front-end than the 
conventional linear detector. 

2) Reduced-dimension decision feedback (RDDF) detector: The RDDF detector determines active 
users and their corresponding symbols iteratively. It starts with an empty set as the initial estimate for 
the set of active user zeros as the estimated symbol vector b^°^ = 0, and the front-end output as 
the residual vector v(°^ = y. Subsequently, in each iteration k — 1, • • • , -fC, the algorithm selects the 
column a„ that is most highly correlated with the residual v'^'^^^^ as the detected active user in the kih 
iteration, with the active user index: 

Uk = arg max |3fi[a^v('=-^)] I . (26) 

This index is then added to the active user set X^^^ = X(^~^) u {uk}. The symbol for user Uk is detected 
with other detected symbols staying the same: 

hik) _ J sgn(3ftKaf^v(^-i)]), n = n^; 

[ On , n^Uk- 

Then the residual vector is updated through 

yW^y-ARb^'^^ (28) 
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The residual vector represents the part of b that has yet to be detected by the algorithm along with 
noise. The iteration repeats K times (as we will show, with high probability DF-OMP never detects the 
same active user twice), and finally the active user set is given by X = I^^'> with the symbol vector 
bn = bi''\n = !,■■■ ,N. 

The RDDF detector computes the inner product between the received signal vector and the signature 
waveform vector a^v^*^-', n = 1, ■ ■ ■ , A^, for k = 1, - ■ ■ , K. This requires KMN floating point operations 
when A is real {2KMN operations when A is complex) for detection of A^log2 3 bits. Hence the 
complexity-per-bit is proportional to KM. Since M < N, this implies that the complexity for data 
detection of the RDDF detector is on the same order as that of the conventional DF detector (the 
complexity-per-bit of the DF detector is proportional to KN). But the RDDF detector requires much 
lower decorrelation complexity in the analog front-end than the conventional DF detector. 

3) Noise whitening transform: The noise in the RD-MUD output (19) is in general colored due to 
the matched filtering at the front-end. We can whiten the noise at the front-end output by applying a 
linear transform before detecting active users and symbols, as illustrated in Fig. 3. The linear transform 
to whiten noise in the RD-MUD output is given by (AG^^A^)^^/^, and the whitened output is given 
by: 

^ (AG-iA^)-i/V = (AG-iA^)-i/2ARb + Wo, (29) 

where Wq is a Gaussian random vector with zero mean and covariance matrix (X^I. If we define a new 
measurement matrix 

A^ ^ (AG-^A^)-i/'A, (30) 

then the RDD and RDDF detectors work with the whitened output (29) if we replace A with A^^ and y 
with in (22), (23), (26) and (27). While whitening the noise in RD-MUD front-end output, the noise 
whitening transform also distorts the signal component. As we will demonstrate via numerical examples 
in Section V-B2, the benefits of noise whitening exceed the impact of the corresponding signal detection 
only when the signature waveforms {s„(t)} are highly correlated. Since this is typically not the case 
in multiuser systems due to the interference between users that coexists [^], our analysis will focus on 
detectors without noise whitening, and the benefits of omitting the noise whitening demonstrated in our 
numerical results. 
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Fig. 3: The diagram of a MUD detector with the noise whitening transform. 



4) Other RD-MUD linear detectors: As we discussed in Section I, various linear detectors have been 
developed for the MF-bank front-end output in the conventional MUD setting. In this section we explore 
some alternate linear detectors other than the decorrelating detector in the reduced-dimension setting. 
In particular, we will examine the reduced-dimension MMSE (RD-MMSE) and the reduced-dimension 
least-squares (RD-LS) detectors. 

Reduced-dimension MMSE (RD-MMSE) detector: 

Similar to the MMSE detector of the conventional MUD, a linear detector based on the MMSE 
criterion can be derived for (20) based on the active user set estimate X given by (22). In other words, 
we want to find a linear transform M that minimizes IE{||bj — Myp}, where the expectation is with 
respect to the vector of transmitted symbols hj- and the noise vector w. Following the approach for 
deriving the conventional MMSE detector [ ], we assume that hj- has a distribution that is uncorrelated 
with the noise w and Ejbjbl'^} = I. Adapting the techniques for deriving the conventional MMSE 
detector [3] and taking into account the noise co variance matrix AG^^A^ of the reduced-dimension 
model (20), we obtain the linear transform for the reduced-dimension MMSE (RD-MMSE) detector as: 

M = %Af (AjR|Af + a^AG^^A^)-^ (31) 

The derivations are given Appendix B. Accordingly, the RD-MMSE detector determines symbols as: 

_ J sgn([%Af(A^RjAf + a2AG-^A^)-iy]„), n G X; 
[ 0, n^X. 

In summary, the RD-MMSE detector determines active users first through the support recovery method 
of (22) and then uses (32) to detect symbols. 

Reduced-dimension least squares (RD-LS) detector: 

In the reduced-dimension model (20), based on the active user set estimate X given by (22), the matrix 
A^-Rj introduces interference when we detect hj-. From the view of the system of linear equations, (20) 
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is an over-determined system with more equations than unknowns since in general we require the number 
of branches to be greater than the number of active users M > K. Hence we cannot directly invert the 
matrix A^Rj to remove interference, mimicking the idea of the decorrelating detector of conventional 
MUD. Alternately, we can alleviate the effect of interference using the method of least-squares (LS) to 
find an estimate of b^: bj = argminx ||y— A^R^xp, and detecting symbols as the signs of the estimated 
vector. The solution is given by hj- = ^^^{A^ Aj-)^^A^y. This corresponds to the maximum likelihood 
estimate of hf if (a) we ignore the covariance of the noise w in (20) and assume it is white; (b) we 
know the active users, i.e. X = X, and (c) the gains r„ are known. We call this the reduced-dimension 
least squares (RD-LS) detector. In summary, the RD-LS detector first detects active users by support 
recovery method of (22). Since sgn([b2;]n) = sgn([R|bj]„), the RD-LS detects symbols by: 

^ I sgn (r„5J [(Af A^)-iAf y] J , n 6 X; ^^^^ 
[ 0, n^i. 

We can show that the RD-LS detector (33) and the RDD detector (23) give quite similar results 
in low noise and with low coherence of A. To see this, write A^A^; = I + E, where the symmetric 
matrix E has zeros on the diagonal and the off-diagonals are bounded by the coherence /i of A. As 
discussed in more detail in Section IH-C, for the RDD detector to work well, we choose A with 
small /i. When (K — < 1, by Gershgorin's Theorem, we have that the spectral norm of the 
symmetric matrix E is bounded by p(E) < (K — l)yU < 1. Hence, using Lemma 4 in [ ], we can 
write (A^Aj)~^ = I + X]^i(~E)", and bound the spectral norm of the series by a small number: 
piEZii-m <{K- l)/i/[l -{K- 1)/.]. This means 

< {K-l)^^/[l-{K-l)^l]\\A'}y\\. (34) 

When the coherence /i of A is sufficiently small relative to K and ||A^y||, (34) says that the difference 
between the two vectors {A^ Aj-)~^ A^y and A^y is small. Then when ^ is small and with sufficiently 
small noise, detecting symbols using {A^Aj-Y^A^y in (33) is similar to detecting symbols using A^y 
in (23). As numerically shown in Section V-B3, the conditional probability of detecting wrong symbols 
given the correct support of active users, i.e. P(b ^ b|X = X), for (33) is similar to that for (23). 

5) Maximum likelihood detector: The optimal detector that minimizes the probability-of-error for 
the RD-MUD output is the nonlinear maximum likelihood detector. The maximum likelihood detector 



(AfA^; 



-^Afy-Afyl 
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finds the active users and symbols by minimizing the likelihood function, or, equivalently, minimizing 
the quadratic function ||(AG^^A^)^^/^(y — ARb)||^. This is equivalent to solving the following integer 
optimization problem 



where 6„ = corresponds to the nth user being inactive and this creates an augmented state space 
with one more state of possible transmitted symbols corresponding to a null symbol "0". Hence (35) 
is more complex than the conventional maximum likelihood detector for MUD with BPSK modulation 
(10) since we add the "0" symbol. Similar to the conventional maximum likelihood detector of the 
conventional MF-bank, the maximization in (35) is a combinatorial optimization problem, which can 
be solved by exhaustive search with complexity-per-bit exponential in the number of users. 

C. Choice of A 

In Section III-A we have shown that the coefficient matrix A is our design parameter. In Section 
III-Bl and Section III-B2 we have shown that both the RDD and RDDF detectors are based on the 
inner products between the projected received signal vector and the columns of A, which correspond to 
the signature waveform vectors in the detection subspace. Hence, intuitively, for the RDD and RDDF 
detectors to work well, the inner products between columns of A, or its coherence defined in (14), should 
be small, since each column of A represent a signature waveform vector in the detection subspace. 
Several commonly used random matrices in compressed sensing that have small coherence with high 
probability are: 

(1) Gaussian random matrices: entries a„m are independent and identically distributed (i.i.d.) with a 
zero mean and unit variance Gaussian distribution, with columns normalized to have unit norm; 

(2) Randomly sampled rows of an unitary matrix that satisfies XX^ = X^X = I. For instance, the 
random partial discrete Fourier transform (DFT) matrix, which is formed by randomly selecting 
rows of a DFT matrix F: [F]„m = e*^"*" and normalizing the columns of the sub-matrix, where 




We will focus on the random partial DFT matrix for the following reason. If we choose the number 
of correlators equal to the number of users, i.e. M = N, there is no dimension reduction, and the 
performance of RD-MUD should equal that of the MF-bank. When M = N, the random partial DFT 



b„e{-i,o,i} 



max 2y^(AG"^A^)-^ARb - b^RA^(AG"^A^)-^ARb, 



(35) 
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matrix becomes the DFT matrix with the property that A^A = I, i.e, a^a^ = 5nm- Consequently, in this 
case, {a^y}, which is a set of statistics that the RDD and RDDF detectors are based on, has the same 
distribution as the MF-bank output. To see this, write a^y = a^ (j2m=i ^mrmbm^ +^n^ = ^n^n +a^w, 
where a^w is a Gaussian random variable with zero mean and covariance cr^a ^AG^^A^a^ = [G-^]„^. 
However, the Gaussian random matrix does not have this property: when M = N, a^a^ 7^ for n ^ m, 
and so the performance of RD-MUD using the Gaussian random matrix A is worse than that using the 
random partial DFT matrix. This has been validated in our numerical results in Section V-A3 where 
we will show that when M is relatively large, the Gaussian random matrix performs worse than the 
random partial DFT matrix. 

IV. Performance of RD-MUD 

In the following, we study the performance of RD-MUD with the RDD and RDDF detectors. We 
begin by considering the scenario of a single active user without noise, and then move on to analyze 
the more general scenario with multiple active users and noise. 

A. Single Active User 

The following discussion shows that, when there is only one active user in the absence of noise, 
the RDD detector can detect the correct active user and symbol by using only two correlators, if the 
columns of A are linearly independent. Later we will also show this is a corollary (Corollary 2) from 
the more general Theorem 1. 

Assume there is no noise and only one user with index uq is active. In this case y{t) = r„(,6„QS„,)(t), 
and by assumption we know only one user is active, i.e. we know K = 1. In the conventional MUD, the 
single-user detector based on the MF-bank detects the active user by finding hq = arg max„ | {y{t), s„(t)) | 
and the symbol by 6fio = sgn(r^iQ(?/(t), Sno(i)))- From the Cauchy-Schwarz inequality, for any n, 

\{y{t),Snit))\ = \{rnobnoSnoit),Snit))\ < |r„J || s^^ (t) || || || = |r„J, (36) 

with equality if and only if s„(t) = cSnoit) for some constant c, and thus no = ^o- The symbol can 
also be recovered perfectly, since 

bno = sgn(r„o(?/(t),s„o(t))) = sgn(r^ 6„o(Sno(^), s„o(t))) = &„o- (37) 
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In RD-MUD, with two correlators, the RDD detector determines the active user by finding 

no = arg max \ain{hi{t),y{t)) + a2n{h2{t),y{t))\. (38) 

n=l,--- ,N 

From the Cauchy-Schwarz inequality, 

\ain{hi{t), y{t)) + a2„(/i2(t), l/(t))r < (a^ + a^) [{h^{t),y{t)Y + {h^it), y{t)Y] , (39) 

with equality if and only if a„„ = c{h^{t),y{t)) = ca^nofnoKo = c{no)a^no for both m = 1,2 with 
some constant c(no). If the columns of A are linearly independent, we cannot have two indices n such 
that = c(?2o)amno for m = 1,2. Also recall that the columns of A are normalized, + a^^ = 
||an|p = 1. Therefore, the maximum is achieved only for n = uq and c(no) = 1, which detects the 
correct active user. The detected symbol is also correct, since 

K = sgn(r„Jai„,(?/(t), hi{t)) + a2„o(y(t), h2{t))]) = sgn(r2^6„Ja2„^ + a^^J) = (40) 

In the presence of noise, detectors in RD-MUD as well as those based on the conventional MF-bank 
will make detection errors. However, RD-MUD can have a performance similar to the detectors based 
on the MF-bank, as we now explain using a geometric intuition and later prove formally in Section 
IV-C. Consider a scenario with three users having orthogonal signature waveforms, where only the first 
user is active. Suppose that bi = 1 and ri = 1, as illustrated in Fig. 4. By correlating with signature 
waveforms, the MF-bank (Figure 4a) obtains inner products (5) of the received signal y{t) with each 
of the signature waveforms and detects based on these inner products. Because of noise, in 

Fig. 4a the received signal y(t) does not coincide with si{t). However, when the noise is sufficiently 
small, as shown in Fig. 4a, the inner product of the received signal with the first signature waveform is 
the largest and positive, and hence in this scenario the single-user detector based on the conventional 
MF-bank detects the correct active user and its symbol. On the other hand, for the same setting, the 
RDD detector with two correlators (Fig. 4b) projects the received signal onto the detection subspace via 
(17), and then obtains the decision statistics by computing the inner product between y, the mapping 
of the received signal onto the subspace, and a„, the projection of each signature waveform onto the 
subspace. Because of noise, in Fig. 4b the projected signal vector y does not coincide with the first 
signature waveform vector Hi. However, when the noise is sufficiently small, as shown in Fig. 4b, the 
inner product with the first signature vector is still the largest and positive, and hence in this scenario 
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the RDD detector also detects the correct active user and symbol. From the above discussion we see that 
the RDD detector works well when the columns of A are nearly orthogonal and the noise is sufficiently 
small. The former requirement is equivalent to requiring the coherence of A to be as small as possible. 
The above discussion applies to the RDDF detector as well since its detection is also based on the inner 
products in the projection space. 




(a) MF Bank (b) RD-MUD 

Fig. 4: A MUD problem with N = 3, M = 2, K = l and the received signal is due to the first user. 
Illustration of the detection by (a) the single-user detection of the MF-bank, and (b) the RD-MUD. The 
projection of the signature waveform onto the projection subspace results in signature waveform vectors 
{s„}. 

B. Noise Amplification of Subspace Projection 

The RDD and RDDF detectors use the set of statistics {a^y} to detect active users and their symbols, 
which has noise components {a^w}. We will show that the projection onto the detection subspace 
amplifies noise. To see this, first consider a special case with orthogonal signature waveforms, i.e. 
G = I, and hence the noise amplification is not caused by correlated signature waveforms. Assume 
the random partial DFT matrix is used as the coefficient matrix A. Using (19), the noise variance of 
the detection statistic for the nth user is given by (T^a^AA^a„ = a'^{N/M) by the definition of the 
random partial DFT matrix in Section III-C. Hence in this special case the noise variance for each user 
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is amplified by a factor N/M > 1 due to subspace projection. 

In general, from (19) the detection statistic for the rath user has a noise variance cr^a^AG^^A^a„. 
We will show this noise variance can be greater than o"^. First we bound the output noise variance of 
the RD-MUD for each user as 

a'A^i„(G-i)||AVf < a^a^AG-^A^, < K^.,^{G-^)\\X'' . (41) 

We now show that the upper bound in (41) is greater than a^. The factor Amax(G"^) captures the largest 
possible noise amplification due to correlated signature waveforms. We have Amax(G"^) > 1 for the 
following. The sum of all the eigenvalues X]^iAfc(G) = tr(G) = since all the diagonal entries 
of G are one. Since all eigenvalues of G are nonnegative (a property of G is that it is positive semi- 
definite), if Ainm(G) > 1, the sum of the eigenvalues will exceed N, and hence Amin(G) < 1. Since the 
eigenvalues of G^^ are the inverse of the eigenvalues of G, Amax(G^^) > 1. On the other hand, the 
factor ||A^a„|p captures the noise amplification due to subspace projection. It is lower-bounded by 1: 
IIA'^anll^ = l + ijLni^^^nY ^ l+min^^^ |af^a„p. When M < N, the inner product min^^^ \^i^^n\ 
is non-zero, and hence the factor ||A^a„|p will be strictly greater than one. As a result, the upper bound 
on the noise variance in (41) is greater than cr^. 

In the following section, we will capture this noise amplification more precisely by relating the noise 
variance of the decision statistic to the performance of the RD-MUD detectors. 

C. Coherence Based Performance Guarantee 

In this section, we present conditions under which the RDD and RDDF detectors can successfully 
recover active users and their symbols. The conditions depend on A through its coherence and are 
parameterized by the crosscorrelations of the signature waveform through the properties of the matrix 
G. Our performance measure is the probability-of-error, which is defined as the chance of the event 
that the set of active users is detected incorrectly, or any of their symbols are detected incorrectly: 

Pe = P{± 7^ X) + P({x = X} n {b ^ b}). (42) 

We will show in the later section that the second term of (42) is dominated by the first term when (22) 
and (26) are used for active user detection. The noise plays two roles in the Pe of (42). First, the noise 
can be sufficiently large relative to the weakest signal such that a nonactive user is determined as active; 
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second, the noise can be sufficiently large such that the transmitted symbol plus noise is detected in an 
incorrect decision region and hence decoded in error. 

The first error term in (42) is related to the probability-of-error for support recovery (see, e.g. [jj] 
in noise, and the first steps of the greedy algorithms to recover support [6^]). There are two major 
differences in our results on this aspect of RD-MUD performance relative to those previous works. 
First, although noise in the analog signal model (4) is white, matched filtering at the RD-MUD front- 
end introduces colored noise in (19). Second, we take into account the second term in (42), which 
has not been considered in previous work. We find the conditions such that the second term of (42) is 
dominated by the first term of (42). 

Define the largest and smallest channel gains as 

I l^^ll I l^^ll ^/l-2^ 
Tmaxl = max |r„|, |rmin| = mm |r„|. (43) 

n=l n=l 

Our main result is the following theorem: 

Theorem 1. Let b G M^^^ be an unknown deterministic symbol, b„ G { — 1, 1}, n e X, and 6„ = 0, 
n G n = 1, ■ ■ ■ , A^. Assume that the number of active users K is known. Given the RD-MUD 
front-end output y = ARb + w, where A G C*^^^ and G G M^^^ are known, and w is a Gaussian 
random vector with zero mean and covariance a^AG^^A^ , if the columns of A are linearly independent 
and the coherence of A (14) satisfies the following condition: 

Ir^iJ - i2K - 1)^1 wl > 2av/2(l + «)logiV ■ ^/kJg-^) ■ ^max (a^AA^a,), (44) 

for some constant a > 0, and A^~''^"*'"''[7r(l + a) log A^]^^/^ < 1, then the probability-of-error (42) for 
the RDD detector is upper bounded as: 

Pe < A^~"[7r(l + a) \ogN]~^/\ (45) 

If the columns of A are linearly independent and the coherence of A (14) satisfies a weaker condition: 

Ir^inl - {2K - l)/i|r^in| > 2a^2{l + a)\ogN ■ ^JxZJg^) ■ ^max (a ^AA^a„) , (46) 

for some constant a > 0, and A^~'^^+°)[7r(l + a)logA^]^^/^ < 1, then the probability-of-error (42) for 
the RDDF detector is upper bounded by the right hand side of (45). 
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Proof: See Appendix C. ■ 

Note in Theorem 1 that the condition of having a small probability-of-error for the RDDF detector 
is weaker than for the RDD detector. Intuitively, the iterative approach of decision feedback removes 
the effect of the largest element in Rb iteratively, which helps the detection of weaker users. This is an 
extension of the ideas in SIC for standard MUD, except that now the presence as well as the data of 
the strongest user is detected, then removed in the detection subspace, which makes it easier to detect 
the presence and data of the next strongest user, after which the process repeats. 

The main idea of the proof is the following. Consider 1 — Pe = P({X = X} fl {b = b}). First we 
define an event Q = {max„ |a^w| < r} for a quantity r proportional to the right hand side in (44), and 
prove that Q occurs with high probability. This bounds the probability that the noise projected onto the 
detection subspace exceeds r, i.e. it bounds the tail probability of the projected noise. Then we show 
that under the condition (44), whenever Q occurs, the active users can be correctly detected, which 
means Q C {X = X}. On the other hand, we show that under a condition weaker than (44), whenever Q 
occurs, the user data symbols can be correctly detected, which means Q C {6„ = G X}. In other 
words, condition (44) ensures that whenever Q occurs, both the correct set of active users are detected 
and that their data are correctly decoded. This means that under condition (44) for the RDD detector, 
^ C {X = X} n {b = b}, and thus P{g) < P{{i = X} n {b = b}), which concludes the proof. A 
similar but inductive approach is used to prove the performance guarantee for the RDDF detector. 

A special case for Theorem 1 is when AA^ = {N/M)l, max„(af AA^a„) = N/M and G = I, 
Amax(G^^) = 1. This is true when A is the random partial DFT matrix and the signature waveforms are 
orthogonal, and hence the noise in (19) is white. If we scale cr^ by M/N, the right hand sides of (44) 
and (46) are then identical to the corresponding quantities in Theorem 4 of [65]. Hence, for the random 
partial DFT matrix for A, Theorem 1 has the same conditions as those of Theorem 4 in [65]. However, 
Theorem 4 in [65] guarantees detecting the correct sparsity pattern of b (equivalently, the correct active 
users), whereas Theorem 1 guarantees correct detection of not only the active users but their symbols 
as well. That is because, as mentioned above, correct detection of these transmitted symbols comes "for 
free" when the conditions to correctly detect the active users are met. 

Remarks: 

The term max„(a^AA^a„) on the right hand side of (44) and (46) is bounded by 

1 < max(a^AA^a„) < 1 + (iV - (47) 
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Equation (47) follows because max„ (a^AA^a„) = max„ ^^-^(3^3^)^, and 

AT 

1 = (af a„)2 < max V(a^aO' = 1 + max V(af a^)' < 1 + {N - (48) 

n ' n ' 

On the other hand, there is a noise phase-transition effect, in the following sense. Conditions (44) and 
(46) suggest that for the RDD and RDDF detectors to have Pg as small as (45), we need to have 

kminlV^' > 81ogiVA^ax(G-^), (49) 

because a > and (47) holds. If the minimum SNR, i.e. the SNR associated with the minimum gain 
Tmin is not Sufficiently high, these algorithms cannot attain small probability-of-error. We illustrate this 
effect via numerical examples in Section V-Bl (a similar effect can be observed in standard MUD 
detectors using linear or DF detection). 

D. Bounding probability-of-error of RDD and RDDF 

Theorem 1 provides a condition on how small /x has to be to achieve a small probability-of-error. The 
condition and the achievable small probability-of-error are related by the constant a. We can eliminate 
this constant and write Theorem 1 in an equivalent form that gives error bounds for the RDD and RDDF 
detectors explicitly. Define the minimum signal-to-noise ratio (SNR) in the projection subspace as 

where the factor Amax(G^^) captures the noise amplification effect in the projection due to nonorthogonal 
signature waveforms. Also define two factors (3i and (32 as 

A [l-(2ir-l)/i|w|/Kin|p ^ ^ [1 - i2K - l)/i]^ ^ ^^^^ 

max„(a^AA^a„) ' max„(a^AA^a„) 

For the RDD detector, we have already implicitly assumed that 1 — (2K — l)/u|rmax|/|^min| > 0, since 
the right hand side of (44) in Theorem 1 is non-negative. For the same reason, for the RDDF detector, 
we have assumed that 1 - {2K - l)fi > 0. By (47) and (51), (3i < 1 and j32 < 1. We have the following 
corollary from Theorem 1: 

Corollary 1. Under the setting of Theorem 1, with the definitions (50) and (51), the probability-of-error 
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for the RDD detector is upper-bounded by 



P < 2^ 

-T e,RDD ^ 

'-n 



SNRmin n 
7^ ■ Pi 



-1/2 

1 SNR ; „ 

e—i— — ^\ (52) 



with 1 — {2K — l)/i|rmax|/kmin| > 0, Qud the probability-of-error for the RDDF detector is upper 
bounded by 



-Pe.RDDF < 



2N 



SNRmin n 
7^ ■ P2 



-1/2 

_lSNR^ 

■ e 4 2 P2 (53) 



wjY/j 1-{2K - > 0. 



Proof: We begin by bounding the probability-of-error of the RDD detector from Theorem 1 . Under 
condition (44), by (45) the probability-of-error of the RDD detector is bounded by iV""[7r log(A^^+")]"^/2 
for some constant a > 0. To make the bound tight, we choose a as large as possible such that it still 
satisfies (44). With definitions (50) and (51), for the RDD detector, we can rewrite (44) as 



[logiv(i+°)]V2< 1 



SNRmin 

— — ■ Pi 



1/2 

(54) 



or equivalently 



N'^ < N-^^'^-^K (55) 



The right hand sides of (54) and (55) are the largest values for [log A^(^+°)]^/^ and A^" we can obtain 
under (44), for given SNRmin, A, K and A^. Combining (54) and (55) in (45), we have (52). Similarly, 
by choosing the largest possible a satisfying (46), we derive the bound (53) on the probabihty-of-error 
of the RDDF detector. ■ 
Remarks: 

For bounds (52) and (53) to be meaningful, they have to be less than one, and hence SNRmin should 
be on the order of log A^. Also note that the error bounds (52) and (53) for the RD-MUD detectors are 
larger when the signature waveforms are correlated, since these error bounds increase in SNRmin, and 
SNRjnin decreases in Aniax(G^^). This implies that the performance of the RD-MUD detectors tend to 
perform worse when signature waveforms are nonorthogonal (the same effect can also be observed for 
the conventional MUD detectors based on the MF-bank). 

By letting the noise variance go to zero in (52) and (53) for the RDD and RDDF detectors, we 
can derive the following corollary from Theorem 1 (another proof for the RDD detector in this case 
has been given in Section IV- A). 
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Corollary 2. Under the setting of Theorem 1, in the absence of noise, the RDD detector can correctly 
detect the active users and their symbols if jJ^ < |?"mm|/[|'"max|(2-ft' — 1)], and the RDDF detector can 
correctly detect the active users and their symbols if ^ < l/(2fC — !)• In particular, if K = 1, with 
M = 2 correlators, Pe = for the RDDF detector, and if furthermore |rmax| = l^'mml, Pe = for the 
RDD detector (which has also been shown in Section IV-A). 

Proof: In Theorem 1, if we let cr^ go to zero, then SNRmin goes to infinity, and the right hand 
sides of both (52) and (53) go to zero, i.e. Pe = 0, as long as /3i > and (32 > 0, or equivalently, 
1 - {2K - l)|ri„ax|/kmm| > and 1 - {2K - > 0. When K = 1, the bound on /i for the RDDF 
detector becomes 1/{2K — 1) = 1, which is satisfied for any /i as long as M > 2 (since Theorem 1 
also requires linear independence of the columns of A and this rules out the possibility of M = 1). ■ 



E. Comparison with existing bounds 

In this section we compare the bound on the probability-of-error in the literature for the decorrelating 
detector of the conventional MUD with our bound for the RDD and RDDF detectors derived from 
Theorem 1 . The decorrelating detector is the counterpart of the RDD detector in the conventional MUD 
setting. To see this, note that for the RD-MUD front-end, when M = N, we can choose the coefficient 
matrix A = I such that the output data model (19) is equivalent to the MF-bank decorrelating detector 
(7) (with T = G"^). 

For the decorrelating detector of the conventional MUD, a commonly used performance measure is 
the probability of error of each user [71][8], which is given by [71] [3]: 



(56) 



where Q{x) = /^°°(l/v^27r)e~^^/^(iz is the Gaussian tail probability. To compare (56) with the P^ bound 
defined by (42), which consists of both active user detection error and symbol error, we consider the 
case when all uses are active, i.e. K = N, and then Pg is only due to symbol error. In this setting, we 
have, using the union bound and (56): 



Pe = P{h y^h)< Y,P{hn ^ 6„) < NQ (^^/MR~^ < ^ 
n=l ^ 



SNR, 



(57) 



where we have also used the fact that |r„|/ o\J[G > a/SNR~ and Q{x) is decreasing in x, as 
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well as the bound on Q{x) [ ] on Q{x) given by 



Q{x) < ^e-^'/l (58) 



The bounds on Pg of the RDD and RDDF detectors when 1 < K < N are given in (52) and (53), 
respectively. Since /3i < 1 and (32 < 1, the error bounds (52) for the RDD detector and (53) for the 
RDDF detector are larger than the bound (57) for conventional MUD. This is because the RDD and 
RDDF detectors have one extra source of error from detecting the wrong set of active users, and also 
because the noise can be amplified by the projection onto the detection subspace, as discussed in Section 
IV-B. The enlargement of the error bound due to subspace projection is captured by factors (3i and (32 
for the RDD and RDDF detectors, respectively. These factors reduce the effect of SNRmin in the bounds. 
Note that (3i and (32 increase in fi, and hence we want small fi, which leads to a small error bound for 
RD-MUD detectors. 

A special case is when K = N, A = 1. Then fi = 0, max„(a„AA^a„) = 1, and by definition (51), 
(3i = (32 = 1. At the beginning of this section, we have shown that this correspond to the decorrelating 
detector of the conventional MUD. In this case the bounds (53) and (57) become the same expression 



1/2 

(59) 

Compared with the bound (57) for Pg of the conventional decorrelating detector, the bound (59) obtained 
from our result is larger. This can be explained since (59) is obtained as a special case of RD-MUD 
which must also detect active users. As we have shown in the proof for Theorem 1, the error from 
detecting active users dominates the error from detecting symbols. 



F. Lower Bound on the Number of Correlators 

Theorem 1 is stated for any matrix A. If we substitute the expression for coherence of a given A 
in terms of its dimensions M and into Theorem 1, we can obtain a lower bound on the smallest 
number of correlators M needed to achieve a certain probability-of-error. 

There is a general lower bound on the coherence of any M x N matrix A given by [72]: 

1/2 / M \ 1/2 

= / \ . _ M/Nf' ~ M-'/\ (60) 

when is large relative to M and N is much larger than 1 . In the absence of noise, the upper bound 



N-M 

M{N - 1) 
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on the coherence in Corollary 2 together with the bound (60) imply that, for the RDDF detector to 
have perfect detection, the number of correlators M should be on the order of {2K — 1)^. In the 
compressed sensing literature, it is known that the bounds obtained using the coherence of the matrix A 
may not be as sharp as those obtained using the restricted isometry properties of A [72]. For example, 
in compressed sensing, to estimate a sparse vector with k non-zero entries, the lower bound on the 
number of measurements obtained from the coherence of A is proportional to k^, while that obtained 
from the restricted isometry properties is proportional to k. This effect is referred to as the "quadratic 
bottleneck" in the compressed sensing literature { l\. Nevertheless, the coherence properties are easy 
to evaluate, while evaluating the restricted isometry property of a given matrix A is in general NP-hard 
[65]. Also as we demonstrate in the proof of Theorem 1, the coherence is a convenient measure of the 
user interference level in the detection subspace. For this reason, our result is based on the coherence 
of matrix A. 

In the compressed sensing literature, the matrix A is often chosen to be random, in which case its 
coherence can be bounded in probability. Consider for example a random partial DFT matrix. We have 
the following result (which can be proven easily by the complex Hoeffding's inequality [~^]): 

Lemma 1. Let A G C*^^^ be a random partial DFT matrix. Then the coherence of A is bounded by 

/i< [4(21ogAr + c)/M]'/% (61) 

with probability exceeding 1 — 2e~^ for some constant c > 0. 

Using Lemma 1, we have the following corollary to Theorem 1: 

Corollary 3. Consider the setting of Theorem 1, where A is a random partial DFT matrix A. Suppose 
the number of correlators satisfies the following lower bound for the RDD detector 



M> A 



{2K-l)\r^J^^ 



\r ■ — 2t 

I ' mm I ' 

or satisfies the following smaller lower bound for the RDDF detector 



(2 log iV + c), (62) 



M > 4 



(2/^-l)|r„,i„'^' 



(2 log iV + c), (63) 



I '"mini 2t 

for some constants c > and a > 0, and |rmin| > 2r, for t defined in (70). Then the probability-of-error 
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Pg of the RDD detector or the RDDF detector is bounded by 

1 - (1 - A^-"[7r(l + a) logiV]"^/2)(l - 26"'=), (64) 

for some constant a > 0. 

This corollary says that to attain a small probability-of-error, the number of correlators needed by 
the RDD and RDDF detectors is on the order of logA^, which is much smaller than that required by 
the conventional MUD using a MF-bank, which is on the order of A^. 

V. Numerical Examples 

As an illustration of the performance of RD-MUD, we present some numerical examples mainly of 
both the RDD and RDDF detectors. The results are obtained from 10^ Monte Carlo trials. For each 
trial, we generate a Gaussian random noise vector as well as a random partial DFT matrix for A, and 
form the signal vector according to (19). To simplify, we assume that the gains for all the users are 
the same: |rmin| = |rmax| = r = 1. First we consider noise-free scenarios with an increasing number 
of users for a fixed number of active users K, and then with increasing K for a fixed A^. Next we 
consider two noisy scenarios with orthogonal waveforms G = I and nonorthogonal waveforms G 7^ I. 

A. Noise -Free Scenario 

In the absence of noise, from (7) of the MF-bank, the conventional decorrelating detector has output 
z = Rb. The conventional decorrelating detector determines the active users by choosing the K largest 
of \zn\, which is equivalent to choosing the K largest of {|r„6„|} in the absence of noise. Recall that the 
inactive users have = 0, which means the conventional decorrelating detector can correctly detect the 
active users. It then detects symbols by 6„ = sgn(r^6„) = 6„. For the above reasons, the conventional 
decorrelating detector has Pe = in the absence of noise. 

1) Pe vs. M, as N increases: Fig. 5 shows the Pe of the RDD detector as a function of M , for fixed 
K = 2, and different values of N. The data points marked on the curves correspond to M = logA^, 
2 log A^, and 4 log A^. When M = 8 log A^, Pe = for the RDD detector for all values of A^. This example 
clearly demonstrates the logA^ scaling factor for the required number of correlators in Corollary 3. 

2) Pe vs M, as K increases: Fig. 6a demonstrates the Pe of the RDD detector as a function of 
M, for a fixed A^ = 100, and different values of K. The points marked on the curves correspond to 
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Fig. 5: In the absence of noise, Pg versus M of the RDD detector, for K = 2 and different A^. The 
marked points on the curves are equal to log A^, 2 log A^, and 4 log A^. When M = 8 log N, Pf. = for 
the RDD detector. 

M = {K\ogN)/2, K\ogN, 1.5K\ogN, 2K\ogN and min{A^, S/sTlog A^}. Clearly, the number of 
correlators needed to obtain Pg < 10^^ increases as K increases. When K = 10, the RDD detector 
needs about 80 correlators to obtain Pg < 10^"^. The number of correlators needed to achieve a small 
probability-of-error can be improved by using the RDDF detector. As shown in Fig. 6b, when K = 10, 
the RDDF detector uses only 60 correlators to obtain Pg < 10""^. 

Fig. 6a also demonstrates how Corollary 2 can be used to estimate the number of correlators needed 
to achieve a small probability-of-error. Corollary 2 says that we need fi < 1/{2K — 1) in the absence 
of noise to have perfect detection. When K = 2, this requires /i < 1/3. We then obtain an estimate 
for coherence of the random partial DFT matrix with A^ = 100 and various M, by averaging over 10^ 
trials, and find that when M is about 30 the coherence is less than 1/3. This is consistent with Fig. 6a, 
which shows that when M = 28, Pg is on the order of 10"^. 

3) Random partial DFT vs. Gaussian random matrices: We compare the performance of the RDD 
detector using the random partial DFT matrix versus using the Gaussian random matrix for A (defined 
in Section III-C). In Fig. 7, the probability-of-error of the Gaussian random matrix converges to a value 
much higher than zero, whereas that of the random partial DFT matrix converges to zero (the value 
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N = 100, M = Klog(N)/2, Klog(N), 




M: # of Correlators 

(a) Pe vs. M for different K, RDD. (b) Pe vs. M for K = 10, RDD and RDDF. 

Fig. 6: In the absence of noise, Pg versus M for = 100, using (a): the RDD detector for different 
K, and (b): the RDDF detector compared with the RDD detector when K = 10. 



achieved by the conventional decorrelating detector), when M increases to A^. 
B. Noisy Scenario 

Next, we consider noisy scenarios. For comparison, we also consider the conventional decorrelating 
detector, which corresponds to the RDD detector with M = as we explained in Section IV-E. 

1) Pe vs. M, as SNR increases, G = I: We study Pg versus M for the RDD detector as SNR increases 
when the signature waveforms are orthogonal and G = I and hence the noise in (19) is white. In this 
case SNRmin = IrmmP/o"^ = r^/cr^, which is denoted as SNR in Fig. 8. Assume = 100 and K = 2. 
In Fig. 8, when SNR increases, the curves converge to the noise-free curve for K = 2 shown in Fig. 
6a, and to the noise-free curve for A^ = 100 shown in Fig. 5. Note that there is a noise phase-transition 
effect in Fig. 6a, which is discussed in the Remarks of Section IV-C. The analysis in (49) implies that 
for A^ = 100 and G = I, we need SNR to be at least 15.7dB to obtain a small Pg, which is consistent 
with Fig. 6a. 
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N = 100, K = 6 




M: # of Correlators 



Fig. 7: In the absence of noise, Pe versus M of the RDD detector using the random partial DFT versus 
using the Gaussian random matrices for N = 100 and K = 6. 

2) Pf. vs. M, performance of the noise whitening transform, G ^ I: Next we consider a scenario 
when the signature waveforms are nonorthogonal G 7^ I. We generate an arbitrary symmetric G with 
ones on the diagonal and fix it in the Monte Carlo trials. In the first case we consider highly correlated 
signature waveforms with Amax(G^^) = 493.9595. In the second case we consider nearly orthogonal 
signature waveforms with Amax(G^^) = 4.0771. Then we compare the Pg of the RDD detector in 
these two cases without and with the noise whitening transform in Section III-B3. Assume N = 100, 
K = 2, r = 1 and a = 0.1. In Fig. 9a, when the signature waveforms are highly correlated, the noise 
whitening transform significantly reduces Pg for large M. In this case, the conventional decorrelating 
detector without the noise whitening transform has a non-negligible probability-of-error, and that with 
the noise whitening transform has a probability-of-error less than lO^''. In Fig. 9b, when the signature 
waveforms are nearly orthogonal, the noise whitening transform does not reduce Pg much. In this 
case, the conventional decorrelating detector without and with the noise whitening transform both have 
probability-of-error less than 10^"^. We also verified that using the noise whitening transform cannot 
achieve the probability-of-error that is obtained with orthogonal signature waveforms G = I. This is 
because the noise whitening transform distorts the signal component. 
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Fig. 8: Performance of the RDD detector, Pg versus M for different SNRs, when the signature waveforms 
are orthogonal, i.e. G = I. The dashed lines show Pg for the conventional decorrelating detectors at the 
corresponding SNR. When SNR is greater than 15 dB (with = 100 correlators), the probability-of- 
error of the decorrelating detector is less than 10^^. 

Jj Pe vs. M, RD-MUD linear detectors: In this example, we compare the performance of RD-MUD 
linear detectors when G 7^ I. In Theorem 1 we have proven that the error is dominated by that from 
active user detection. So we compare the performance of these RD-MUD linear detectors using their 
conditional probability of symbol error given the correct detection of active users P(b 7^ b|X = X). 
Assume = 100, = 2, r = 1, a = 0.1, and let G take the same form as those used in Fig. 9. In 
Fig. 10, the performance of the RDD detector is similar to that of the RD-LS detector (the explanation 
is given in Section III-B4). In comparison, the RD-MMSE detector has smaller conditional probability 
of error, especially in Fig. 10a with highly correlated signatures. This improvement is because the 
hnear transform (32) of the RD-MMSE detector alleviates the effect of correlated signature waveforms 
by including an inversion of G in the linear transform. The conditional probability-of-error of the 
conventional decorrelating detector is less than 10""^. 
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(a) Large Aniax(G highly correlated signature waveforms (b) Small Amax(G ^): nearly orthogonal signature waveforms 

Fig. 9: Comparison of versus M, without and with the noise whitening transform, when N = 100, 
K = 2, and for two different cases of G. The conventional decorrelating detector with the noise 
whitening transform in Fig. 9a, and the conventional decorrelating detectors with and without the noise 
whitening transform in Fig. 9b have probability-of-error less than 10~^. 

VI. Conclusions and Discussions 

We have developed a reduced dimension multiuser detection (RD-MUD) structure, which decreases 
the number of correlators at the front-end of a MUD receiver by exploiting the fact that the number of 
active users is typically much smaller than the total number of users in the system. Motivated by the 
idea of analog compressed sensing, the RD-MUD front-end projects the received signal onto a lower 
dimensional detection subspace by correlating the received signal with a set of correlating signals. The 
correlating signals are constructed as linear combinations of the signature waveforms using a coefficient 
matrix A, which determines the performance of RD-MUD and is our key design parameter. Based 
on the front-end output, RD-MUD detectors recover active users and their symbols in the detection 
subspace. We have studied in detail two such detectors. The reduced-dimension decorrelating (RDD) 
detector, which is a linear detector that combines subspace projection along with thresholding for active 
user detection and sign detection for data recovery. The reduced-dimension decision feedback (RDDF) 
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(a) Large Amax(G' ^): highly correlated signature waveforms (b) Small Amax(G' ^): nearly orthogonal signature waveforms 

Fig. 10: Comparison of the conditional probability of error P(b ^ b|X = I), for RD-MUD 
linear detectors: RDD, RD-LS, and RD-MMSE. The RD-MMSE detector has the smallest conditional 
probability of symbol error. 



detector is a nonlinear detector that combines decision-feedback orthogonal matching pursuit (DF-OMP) 
for active user detection with sign detection for data recovery. We have shown that to achieve a desired 
probability-of-error, the number of correlators used by the RD-MUD can be much smaller than that 
used by the conventional MUD, and the complexity-per-bit of the RD-MUD detectors are not higher 
than their counterpart in the conventional MUD setting. In particular, when the random partial DFT 
matrix is used for the coefficient matrix A and the RDD and RDDF detectors are used for detection, the 
RD-MUD front-end requires the number of correlators proportional to the log of the number of users, 
whereas the conventional MF-bank front-end requires the number of correlators equal to the number 
of users in the system. We have obtained theoretical performance guarantees for the RDD and RDDF 
detectors in terms of the coherence of A, which are validated via numerical examples. 

Our results are based on binary modulation and can be extended to higher order modulation with 
symbols taking more possible values. With higher order modulation, however, the conditions to guar- 
antee correct symbol detection may be stronger than the conditions to guarantee correct active user 
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detection. This general modulation set-up is further related to problems in compressed classification 
and compressed learning (see, e.g., [74]). We have also assumed that the signature waveforms are 
given. Better performance of RD-MUD might be obtained through joint optimization of the signature 
waveforms and the coefficient matrix A. 

Appendix A 

COVARIANCE OF RD-MUD OUTPUT NOISE 
Consider the covariance of the output noise at the nth and the mth branches in the RD-MUD front-end: 

N N 



L k=l 1=1 

N N 

= y^^y^^ankami^{{sk{t),w{t)){si{t),w{t))} 



(65) 



fe=i 1=1 

We want to show that pnm = o"^[AG"^A^]„m- We have 



-2 



(66) 



E{{sk{t),w{t)){si{t),w{t))} = T-' / h{t)si{u)E{w{t)w{u)}dtdu 

Jo Jo 

= T"M / Sk{t)si{u)a^6{t-u)dtdu = (r^T-^ Sk{t)si{t)dt 
Jo Jo Jo 

\ n m I n m 

= C'"^^^^^[G "'"]„fc[G ^]m/[G]nm = 0"^[G \k 
n m 

Substituting this back into (65), we have pnm = IE{w„Wm} = cr^ Ylk=i T.1L1 o-nkamilG'^ = a^fAG^^A^] 
Setting n = m, we have am = IE{w^} = cr^[AG"^A^]mm- This completes the proof. 

Appendix B 
Derivation of RD-MUD MMSE 

Proof: Given the active user index set X obtained from (22), we define W = AjR'^A^ +cr^AG~^A^, 
and M = R^aI'^W"^ We want to show that M = argminMlE{||bj — My|p}. Using the same method 
for deriving the conventional MMSE detector of the MF-bank [ ], we assume that bj has a distribution 
that is uncorrelated with the noise w and that E{bjb^} = I. Based on X, we refer to the model (20). 
Since \\xf = tr(xx^), we can write the MSB as E{||b^ - Myf} = tr(E{(b^ - My)(b^ - My)^}). 
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Now we expand 

E{(bi-My)(bi-My)^} 
= E{b^bf } - E{b^y^}M^ - ME{ybf } + ME{yy^}M^ (67) 
= I + M(A^R|Af + a^AG-^A")M^ - %Af - MA^%. 

It can be verified that MA^% = MWM . Hence from (67), we have 

E{(bj - My)(b^ - My)^} = I + MWM^ - MWM^ - MWM^ 

= I - MWM^ + (M - M)W(M - M)^ (68) 
= I - %Af W"^A^% + (M - M)W(M - M)^. 

Since W is a positive semidefinite matrix, the trace of the second term in (68) is always nonnegative. 
So we conclude that the matrix M that minimizes the MSE tr(E{(b;j - My)(b;j - My)^}) is M, or 
equivalently, E { 1 1 b ^ — My p } , as required. ■ 

Appendix C 
Proof of Theorem 1 

The proof of Theorem 1 for both the RDD and RDDF detectors are closely related. To exploit this 
similarity, we first prove several lemmas that are useful for both results. First, we will demonstrate that 
the random event 

g = \ max laf w| < r I (69) 

l^l<n<Ar' " ' J 

occurs with high probability, where 

r ^ a^/2il + a) logiV ■ ^J Xra..{G-') ■ ^max(a^AA^a„), (70) 

and a > 0. Then we show that when Q occurs, both algorithms can detect the active users and their 
symbols. The proofs follow the arguments in [65] with modifications to account for the fact that w is 
colored noise, and the error can also be caused by incorrect symbol detection. However, as we will 
show, the error probability of active user detection dominates the latter case. 
Sidak's lemma [ ] states the following: 

Lemma 2 (Sidak's lemma). Let [Xi, ■ ■ ■ , X„]^ be a vector of random variables having the n- dimensional 
normal distribution with zero means, arbitrary variances crl, - ■ ■ , o"^, and an arbitrary correlation matrix 
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\P\mk = Pmk- Then, for any positive numbers Ci, ■ ■ ■ , c„, 

P(|Xi| <Ci,|X2| <C2,-- - AXn\<Cn)>P{\Xi\<Ci)-P{\X2\<C2,---,\Xn\<Cn). (71) 

Lemma 3. Suppose that w is a Gaussian random vector with zero mean and covariance a'^AG~^A^ . 
If N~^^^°'^[7r(l + a) logA^]^^/^ < 1 for some a > 0, then the event Q of (69) occurs with probability 
at least one minus (45). 

Proof: The random variables {a^w}^^^ are jointly Gaussian, with means equal to zero, variances 
cr^ equal to a^a^AG^^A^a^, and covariances pnm between the nth and mth random variables equal 
to cT^a^^AG^^A^a™. Define 



T = (t[2(1 + a) \ogN]^/^ ■ Lax(a^AG"^A^a„)l 



(72) 



and an event 



{ max |a^w| < f ^ . (73) 



Using Sidak's lemma, we have 

N 

p{g)=P (|af w| < f, ■ ■ ■ , |a>| < r) > n < r). (74) 

n=l 

Since a^w is a Gaussian random variable with zero mean and variance a^, the tail probability of the 
colored noise can be written as 

P(|a>|<f) = l-2g(^^^. (75) 
By the bound (58) on Q{x), (75) can be bounded as 

P(|a>| <f) >l-r]n. (76) 

where rfn = ■ ^e"^^/^^''"). Define 

,1 1/2 



max(T„ = a 



n 



max(a^AG~^A^a„ 

. n 



5 



A / 2 0"max -t^/(2ct^ ) 

rn 4 / ^ / y max/ 

'/max — \ ^ 

V TT r 



(77) 
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Since a^ax/r = [2(l+a) \ogN]~^/^ by the definition off, we have 7]^^^ = y^[2(l+a) log A^]-i/2g-(i+a) 
It is easy to show that ?7„ increases as cr„ increases. Hence ?7„ < ?7rnax- When r^max < 1, we can use the 
inequality (1 — x)^ > 1 — Nx when x > and substitute the value of r^^ax to write (74) as 

N 

P{Q) > - no) > (1 - W)"^ > 1 - = 1 - iV"°[vr(l + a) \ogNY^I\ (78) 

n=l 

which holds for any r/max < 1 and > 1. 
Next we show that r < r. Note that 

AG-^aV < ||A^a„||2A„,ax(G-i) < [max (a^AA^a„)]A^ax(G-i). (79) 

n 

From inequality (79) and definitions (70) for r and (72) for f, we obtain f < r. Hence 

P(^) = P(max |a^w| < r) > P(max |a^w| < f) = P(^). (80) 

n n 

Combining (78) and (80), we conclude that P{G) is greater than one minus the expression (45), as 
required. ■ 
The next lemma shows that, under appropriate conditions, ranking the inner products between a„ 
and y is an effective method of detecting the set of active users. The proof of this lemma is adapted 
from Lemma 3 in [ ] to account for the fact that the signal vector y here can be complex as A 
can be complex. Since only the real part contains all the useful information, to prove this lemma, we 
basically follow the proof for Lemma 3 in [ ] while using the following inequality whenever needed: 
|5J[afaJ| < |a^a„| < for n ^ m, and |3?[afw]| < |a^w|. 

Lemma 4. Let b be a vector with support I which consists of K active users, and let y = ARb + w 
for a Gaussian noise vector w with zero mean and covariance AG^^A^. Define |rmax| ^^nJ |rmin| as in 
(43), and suppose that 

|r^in|-(2K-l)/i|w| >2r. (81) 
Then, if the event Q of (69) occurs, we have 

min|3ft[a^j]| >max|3ft[a^fj]|. (82) 
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If, rather than (81), a weaker condition holds: 



{2K - l)/i|r^ax| > 2r. 



(83) 



Then, if the event Q of (69) occurs, we have 



maxlJJfafj]! >max|3?[af3']|. 
Proof We begin by deriving a lower-bound for min^gj |3?[a^y]| under the event Q: 



min |3?[a,^y]| = min 



> min \hn\\rn\ 

> |r 



mm I max 



■miirmllaf a^l - max |a^w| 



(84) 



(85) 



> Ir^inl -{K - l)/x|rmax| - 

where we have used the triangle inequality, the fact that \hn\ = 1, |rmin| < kn| < kmax|, |3?[a^am]| < 
|a^am| < /i for n 7^ m, and |3?[a^w]| < |a^w|. On the other hand, we can similarly expand and 
upper-bound max„^x |]R[a^y]|, under the event Q, as 



max |3?[a^y] | = max 



5^6^n^3ft[afa„]+3fJ[a>] 



<max |6^||r^||3fJ[a^^am]| +max|3?[afw] 

n^X n^X 
m£X 



(86) 



Combining (85) and (86), we have that under the event Q, 



min |3fJ[af y]| > |r^in| - {2K - l)fi\r^^.\ - 2r + max |3fJ[a^y] 



(87) 



So when Q occurs, under the condition (81), we obtain (82). 

Similarly, when Q occurs, we expand and lower-bound max„£j |9f?[a^y]|. Assume that no is the index 
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achieving the largest absolute gain: Ir^^l = [rmaxl- Then under event Q: 
max|3i[a^y]|>|5J[af„y]| 



] + 3f?[a^gW] 

> Iwl - Yl \bm\\rmm^^,^m]\ " |3?Kw]| (88) 

> kmaxi -{K - l)/i|rmax| " ^• 

Combining (88) and (86), we have that under the event Q, 

max |3f?[a^y]| > |r^ax| - (2ir - l)/i|r^ax| - 2r + max |5R[a,f y]|. (89) 

So when Q occurs, under the condition (83), we obtain (84), as required. ■ 
The following lemma demonstrates that the sign detector can effectively detect transmitted symbols 
for the RDD and RDDF detectors. 

Lemma 5. Let b be a vector with bn G {1,-1}, for n & I and bn = otherwise, and let y = ARb + w 
for a Gaussian noise vector w with zero mean and covariance o'^AG^^A^ . Suppose that 

|rmin|-(i^-l)/i|w| >r. (90) 

Then, if the event Q occurs, we have 

sgn(r„3f?[a5]) = 6„, neX. (91) 
If, instead of (90), a weaker condition 

kmaxi + kmini " 2(K - l)^l\r^^y\ > 2t (92) 

holds, then under the event Q, we have 

sgn(r„,3ft[aj3']) =&ni, (93) 

for 

Til = arg max 1 3? [a^y] \ . (94) 

n 

Proof: To detect correctly, for 6„ = 1, 3ft[r„a^y] has to be positive, and for 6„ = — 1, 3ft[r„a^y] 
has to be negative. First assume 6„ = 1. We expand 3fJ[r„a^y], find the lower-bound and the condition 
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such that the lower bound is positive. Substituting in the expression for y, using the inequality that 
x + y + z>x — \y\ — \z\, under the event Q, we obtain 



3?[r„af y] = [r^p + J] 6^r„r^3? [a^a^] + rjt [af w] 



^ ^min 



knIkmP [a^^a„] I - |r„p [a^w] 



(95) 



T . 



> kni [kmini - {K - l)yU|rma: 

From (95), 3fJ[r„a^y] > for n G X if (90) holds and 6„ = 1. Similarly, we can show for 6„ = —1, 
under event Q, if (90) holds, 3ft[r„a^y] < 0. Hence if (90) holds we obtain (91). 
Recall that hq is the index of the largest gain: Ir^^l = Irmaxl- Due to (94), we have 



(96) 



We will show that under the event Q, once (92) holds, then sgn(r„^3?[a^^y]) 7^ 6„j leads to a contra- 
diction to (96). First assume 6„j = 1. If 1)^ = sgn(r„j9fj[a^^y]) 7^ b^, then 



bn^ = sgn j r^, + ^ bmrmTm^ K^m] + rn,^ Kw] ) = -1- 

\ m^ni / 

So the expression inside the sgn operator of (97) must be negative. Since r^^ > 0, we must have 

bmrmTm'^ Ka^] + ^w] < 0. 
Multiplying the left-hand- side of (96) by and using the equality |x| ■ \y\ = \xy\, we obtain 



(97) 



(98) 



r„J|3fJ[aJy]| = K| 



^ 6mr„3ft[a^Jam] + 3fJ[afjW] 



(99) 



Due to (97), the last line of (99) inside the | • | operator is negative. Using the fact that r^^ > and 
(98), and the identity |a; + y| = — (x + y) = \y\ — x when x + y < and y < 0, under the event Q, we 
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obtain that 

\rnAm<y]\ 

< |r„J(fr - l)/i|rmax| + kmk - kmlkminl = \rnA[{K - l)Ai|rmax| + r- |r„ 



^ &mrnir^3? [aja„] +r„^3f? [a^^w] 



ni 



(100) 



On the other hand, multiply the right-hand- side of (96) by |r„J. Similarly, using the equality |x| ■ \y\ 
\xy\ and triangle inequality, under the event Q, we obtain 



^ni '"no 



m^no 



\rn,mKy]\ 

> kni|[kmax| - {K - l)/i|rmax| " t] . 



(101) 



Combining (100) and (101), we have that once (92) holds, if 6„j = 1, then sgn(r„^3ft[a^^y]) = —1 leads 
to |3?[a,^^y]| < |3?[a^,y]|, which contradicts (96), and hence sgn(r„^3ft[a,^^y]) = 1. A similar argument 
can be made for hn^ = —1, which completes the proof. 

■ 

We are now ready to prove Theorem 1. The proof for the RDD detector is obtained by combining 
Lemmas 3, 4 and 5. Lemma 3 ensures that the event Q occurs with probability at least as high as 
one minus (45). Whenever Q occurs. Lemma 4 guarantees by using (22), that the RDD detector can 
correctly detect active users under the condition (44), i.e. Q C {X = X}. Finally, whenever Q occurs. 
Lemma 5 guarantees that, based on the correct support of active users, their transmitted symbols can 
be detected correctly under the condition (90), i.e. Q C {6„ = bn,n E I}. Clearly condition (90) is 
weaker than (44), since (44) can be written as |rmin| — {K — l)/i|rinax| > r + (r + Kfi\r^s.K\) > t, and 
hence if (44) holds then (90) also holds. In summary, under condition (44), ^ C {X = X} fl {b = b}, 
and 1 — Pe = -P({X = X} n {b = b}) > P{G), which is greater than one minus (45), which concludes 
the proof for the RDD detector. 

We now prove the performance guarantee for the RDDF detector adopting the technique used in 
proving Theorem 4 in [65]. First we show that whenever Q occurs, the RDDF detector correctly detects 
an active user in the first iteration, which follows from Lemmas 3 and 4. Note that (46) implies (83), 
and therefore, by Lemma 4, we have that by choosing the largest |3?[a^y]|, the RDDF detector can 
detect a correct user in the set X. Second, we show that whenever Q occurs, the RDDF detector 
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correctly detects the transmitted symbol of this active user. Note that (46) also implies (92), since (46) 
can be written as |rmin| > 2r/[l — {2K — l)/i], which implies |rmax| > 2r/[l — (2K — and 
hence Ir^axl + kmini - '2{K - l)/i|rmax| > 2r[l - 2{K - l)/i]/[l - {2K - l)/i] + \r^i^\ > 2t, since 
[1 — 2(K — l)/i]/[l — {2K — > 1. Therefore, by Lemma 5, using a sign detector, we can detect 
the symbol correctly. Consequently, the first step of the RDDF detector correctly detect the active user 
and its symbol, i.e. Q C {X*^^-' C I,bni = ^m}- 

The proof now continues by induction. Suppose we are currently in the A;th iteration of the RDDF 
detector, 1 < k < K, and assume that correct users and their symbols have been detected in all the 
k — 1 previous steps. The A;th step is to detect the user with the largest |3?[a^v'^^^]|. Using the same 
notations as those in Section III-B2 and by definition of v*^'^^ we have 



AR(b-b('=-^))+w = ARx('=-i)+w, (102) 



where x'^'^^^^ = b — b'-'^^^^ This vector has support X/X^^^^^ and has at most K — k + 1 non-zero 
elements, since b*^'^^^-' contains correct symbols at the correct locations for {k — 1) active users, i.e. 
bi!''^^ = bn, for n G X*^'^^^^ This v^^^ is a noisy measurement of the vector ARx^'^^^^ The data model in 
(102) for the kth iteration is identical to the data model in the first iteration with b replaced by x*^*^^^^ 
(with a smaller sparsity K — k + 1 rather than K), X replaced by X/X'^^^^\ and y replaced by v^'^^ 
Since 

|rj^]^| = max |r„| > |rmm|, (103) 

we have that under condition (46) this model (102) also satisfies the requirement (83). Consequently, 
by Lemma 4, we have that under the event Q, 

max |3ft[aV^-^)]| > max |3fJ[aV'=-i)]|. (104) 

Therefore, in the kth iteration, the RDDF detector can detect an active user correctly, i.e. Q C {X'^^^ C X}, 
and no index of active users that has been detected before will be chosen again. On the other hand, since 
(46) can be written as |rmin| > 2r/[l - (27^- l)/i], from (103) this implies |r^lx| > 2r/[l - (2/^- l)/i], 
and hence |r^^ix| - {2K - l)yu|r^^lx| > 2r, and consequently |r|nL| - {2K - 2)/i|ri^ix| + |rmm| > 2r. 
Consequently, condition (92) is true for (102). Then by Lemma 5, we have that under the event Q, 

sgii(3?Ka^j('^-i)]) = 6.,, (105) 
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i.e. Q C {6nJ = bn^}. By induction, since no active users will be detected twice, it follows that the first 
K steps of the RDDF detector can detect aU active users and their symbols, i.e. 

g C Uti{i^'^ C X,6it^ = 6„J = {XW = X,6f ) = 6„,n e X^^)}. (106) 

Note that condition (83) is weaker than (46), since (46) can be written as |rinin|[l — {2K — > 2r, 
which implies |rmax|[l — {'^K — > 2r. This further implies |rinax|[l — '2{K — + \r^in\ > 2r, 
since 1 — 2{K — > 1 — {2K — l)/i and Irmml > 0. Consequently, under condition (46), from (106), 
^ C {X = X} n {b = b}, and 1 - = P{{T = X} n {b = b}) > P{g) which is greater than one 
minus (45), which concludes the proof for the RDDF detector. This completes the proof of Theorem 1. 

References 

[1] A. Duel-Hallen, J. Holtzman, and Z. Zvonar, "Multiuser detection for CDMA systems," IEEE Personal Communications, pp. 46-58, 
April 1995. 

[2] S. Moshavi, "Multi-user detection for DS-CDMA communications," IEEE Comm. Mag., vol. 34, pp. 124 - 136, Oct. 1996. 
[3] S. Verdu, Multiuser Detection. Cambridge University Press, 1998. 

[4] J. G. Andrews, "Interference cancellation for cellular systems: A contemporary overview," IEEE Wireless Comm. Mag.: on advanced 

receiver for CDMA, 2005. 
[5] C. Schlegal and A. Grant, Coordinated multiuser communications. Springer, May 2006. 

[6] M. L. Honig, ed.. Advances in multiuser detection. Wiley Series in Telecommunications and Signal Processing, Wiley-IEEE Press, 
Aug. 2009. 

[7] A. J. Viterbi, CDMA: principles of spread spectrum communication. Addison- Wesley, Apr. 1995. 

[8] R. Lupas and S. Verdu, "Linear multiuser detectors for synchronous code-division multiple-access channel," IEEE Trans. Info. 

Theory, vol. 35, pp. 123 - 136, Jan 1989. 
[9] S. Verdu, "Minimum probability of error for asynchronous Gaussian multiple-access channels," IEEE Trans. Info. Theory, vol. 32, 

pp. 85 - 96, Jan. 1986. 

[10] R. Lupas and S. Verdu, "Near-far resistance of multiuser detectors in asynchronous channels," IEEE Trans. Comm., vol. 38, pp. 496 

- 508, April 1990. 

[11] S. Verdu and S. Shamai (Shitz), "Spectral efficiency of CDMA with random spreading," IEEE Trans. Info. Theory, vol. 45, pp. 622 

- 640, March 1999. 

[12] M. Rupf and J. L. Massey, "Optimal sequence multisets for synchronous code-division multiple-access channels," IEEE Trans. Info. 

Theory, vol. 40, pp. 1261 - 1266, July 1994. 
[13] P. Viswanath and V. Anantharam, "Optimal sequences and sum capacity of synchronous CDMA systems," IEEE Trans. Info. Theory, 

vol. 45, pp. 1984 - 1991, Sept. 1999. 
[14] P. Viswanath and V. Anantharam, "Optimal sequences for CDMA under colored noise: a Schur-saddle function property," IEEE 

Trans. Info. Theory, vol. 48, pp. 1295 - 1318, June 2002. 
[15] S. Ulukus and R. D. Yates, "Iterative construction of optimum signature sequence sets in synchronous CDMA systems," IEEE Trans. 

Info. Theory, vol. 47, pp. 1989 - 1998, July 2001. 



50 



[16] J. A. Tropp, I. S. Dhillon, and R. W. Heath, "Finite-step algorithms for constructing optimal CDMA signature sequences," IEEE 

Trans. Info. Theory, vol. 50, pp. 2916 - 2921, Nov. 2004. 
[17] R. W. Heath, T. Strohmer, and A. J. Paukaj, "On quasi-orthogonal signatures for CDMA systems," IEEE Trans. Info. Theory, vol. 52, 

pp. 1217 - 1226, Mar. 2006. 

[18] J. Luo, S. Ulukus, and A. Ephremides, "Optimal sequence and sum capacity of symbol asynchronous CDMA systems," IEEE Trans. 

Info. Theory, vol. 51, pp. 2760 - 2769, Aug. 2005. 
[19] T. Guess, "CDMA with power control and sequence design: the capacity region with and without multidimensional signaling," IEEE 

Trans. Info. Theory, vol. 50, pp. 2604 - 2619, Nov. 2004. 
[20] A. M. Tulino and S. Verdu, Random matrix theory and wireless communications, ch. 3.1. Now Publishers Inc, June 2004. 
[21] S. Buzzi, H. V. Poor, and A. Zappone, "Transmitter waveform and widely linear receiver design: Noncooperative games for wireless 

multiple-access networks," IEEE Trans. Info. Theory, vol. 56, pp. 4874 - 4892, Oct. 2010. 
[22] M. Simon, J. Omura, R. Scholtz, B. Levitt, J. K. Omura, R. A. Scholtz, and M. K. Simon, Spread spectrum communications 

handbook. McGraw-Hill Professional, Sept. 2001. 
[23] M. K. Varanasi and B. Aazhang, "Near-optimal detection in synchronous code-division multiple-access systems," IEEE Trans. Comm., 

vol. 39, pp. 725 - 736, May 1991. 
[24] M. K. Varanasi and B. Aazhang, "Multistage detection in asynchronous code division multiple access communications," IEEE Trans. 

Comm., vol. 38, pp. 509 - 519, April 1990. 
[25] M. K. Varanasi, "Decision feedback multiuser detection: A systematic approach," IEEE Trans. Info. Theory, vol. 45, pp. 219 - 240, 

Jan. 1999. 

[26] A. J. Viterbi, "Very low rate convolutional codes for maximum theoretical performance of spread-spectrum multiple-access channels," 

IEEE Journal On Selected Areas in Comm., vol. 8, pp. 641 - 649, May 1990. 
[27] A. Duel-Hallen, "Decorrelating decision feedback multiuser detector for synchronous code-division multiple-access channel," IEEE 

Trans. Comm., vol. 41, pp. 285 - 290, Feb. 1993. 
[28] T. Cover, "Some advances in broadcoast channels," m Advances in Communication Systems, New York, NY, USA: Academic, 1975. 
[29] S. Verdu, "Near-far resistant receivers for DS/SSMA conmiunications," tech. rep., U.S. Army Research Proposal, Contract DAAL03- 

87-K-0062, Princeton University, 1986. 
[30] Z. Xie, R. T. Short, and C. K. Rushforth, "A family of suboptimal detectors for coherent multiuser communications," IEEE Journal 

On Selected Areas in Comm., vol. 8, pp. 683 - 690, May 1990. 
[31] J. Yang and S. Roy, "On joint transmitter and receiver optimization for multiple-input-multiple-output (MEMO) transmission systems," 

IEEE Trans. Comm., vol. 42, pp. 3221 - 3231, Dec. 1994. 
[32] H. V. Poor and S. Verdu, "Probability of error in MMSE multiuser detector," IEEE Trans. Info. Theory, vol. 43, pp. 858 - 871, May 

1997. 

[33] M. Mishali and Y. C. Eldar, "Reduce and boost: Recovering arbitrary sets of jointly sparse wideband analog signals," IEEE Set. 

Topics Signal Process., vol. 56, pp. 4692 - 4702, Oct. 2008. 
[34] Y. C. Eldar and M. Mishali, "Robust recovery of signals from a structured union of subspaces," IEEE Trans. Info. Theory, vol. 55, 

pp. 5302 - 5316, Nov 2009. 

[35] Y. C. Eldar, "Compressed sensing of analog signals in shift-invariant spaces," IEEE Trans. Signal Process., vol. 57, pp. 2986-2997, 
August 2009. 

[36] M. Mishali and Y. C. Eldar, "Blind multiband signal reconstruction: Compressed sensing for analog signals," IEEE Trans. Signal 
Process., vol. 57, pp. 993 - 1009, Mar. 2009. 



51 



[37] Y. C. Eldar, "Uncertainty relations for shift-invariant analog signals," IEEE Trans. Info. Theory, vol. 55, pp. 5742 - 5757, Dec. 2009. 
[38] M. Mishali and Y. C. Eldar, "From theory to practice: Sub-Nyquist sampling of sparse wideband analog signals," IEEE Journal of 

Selected Topics in Signal Process., vol. 4, pp. 375 - 391, April 2010. 
[39] K. Gedalyahu and Y. C. Eldar, "Time-delay estimation from low-rate samples: A union of subspaces approach," IEEE Trans, on 

Signal Process., vol. 58, pp. 3017 - 3031, June 2010. 
[40] M. R Duarte and Y. C. Eldar, "Structured compressed sensing: from theory to applications," submitted to IEEE Trans. Signal Process. 

and arXived, 2011. 

[41] R. Gribonval, B. Mailhe, H. Rauhut, K. Schnass, and R Vandergheynst, "Average case analysis of multichannel thresholding," in 

Proc. IEEE ICASSP07, (Honolulu), 2007. 
[42] T. Blumensath and M. E. Davies, "Iterative hard thresholding for compressed sensing," Appl. Comput. Harmon. Anal, vol. 27, 

pp. 265 - 274, 2009. 

[43] Y. C. Rati, R. Rezaiifar, and P. S. Krishnaprasad, "Orthogonal matching pursuit: Recursive function approximation with applications 
with wavelet decomposition," in Proc. 27th Asilomar Conf. Signals, Systems, pp. 40 - 44, Nov. 1993. 

[44] J. Tropp, "Greed is good: Algorithmic results for sparse approximation," IEEE Trans. Inf. Theory, vol. 50, pp. 2231 - 2242, Oct. 
2004. 

[45] W.-C. Wu and K.-C. Chen, "Identification of active users in synchronous CDMA multiuser detection," IEEE Journal On Selected 

Areas in Comm., vol. 16, pp. 1723 - 1735, Dec. 1998. 
[46] T. Oskiper and H. V. Poor, "Online activity detection in a multiuser environment using a matrix CUSUM algorithm," IEEE Trans. 

Info. Theory, vol. 48, pp. 477 - 493, Feb. 2002. 
[47] E. BigUeri and M. Lops, "Multiuser detection in dynamic environment - Part I: user identification and data detection," IEEE Trans. 

Info. Theory, vol. 53, pp. 3158 - 3170, Sept. 2007. 
[48] D. Angelosante, E. Biglieri, and M. Lops, "Multiuser detection in dynamic environment - Part 11: joint user identification and 

parameter estimation," IEEE Trans. Info. Theory, vol. 55, pp. 2365 - 2374, May 2009. 
[49] D. Angelosante, E. Biglieri, and M. Lops, "Low-complexity receivers for multiuser detection with an unknown number of active 

users," Signal Processing, vol. 90, pp. 1486 - 1495, May 2010. 
[50] A. T. Campo, A. G. Fabregas, and E. Biglieri, "Large-system analysis of multiuser detection with an unknown number of users: A 

high-SNR approach," IEEE Trans. Info. Theory, vol. 57, pp. 3416 - 3428, June 2011. 
[51] G. Ricci and M. K. Varanasi, "Blind multiuser detection of L out of K active users of an N user synchronous CDMA system," in 

IEEE 49th Vehicular Tech. Conf, pp. 1445 - 1449, May 1999. 
[52] M. K. Varanasi, "Group detection for synchronous Gaussian code-division multiple-access channels," IEEE Trans. Info. Theory, 

vol. 41, pp. 1083 - 1096, July 1995. 
[53] E. J. Candes and T. Tao, "Near-optimal signal recovery from random projections: Universal encoding strategies?," IEEE Trans. Info. 

Theory, vol. 52, pp. 5406 - 5424, Dec. 2006. 
[54] D. L. Donoho, "Compressed sensing," IEEE Trans. Info. Theory, vol. 52, pp. 1289 - 1306, April 2006. 

[55] A. K. Fletcher, S. Rangan, and V. K. Goyal, "On-off random access channels: A compressed sensing framework," submitted to IEEE 

Trans. Info. Theory and arXived., March 2010. 
[56] A. K. Fletcher, S. Rangan, and V. K. Goyal, "Necessary and sufficient conditions on sparsity pattern recovery," IEEE Trans. Info. 

Theory, vol. 55, pp. 5758 - 5772, Jan. 2009. 
[57] Y. Jin, Y.-H. Kim, and B. D. Rao, "Support recovery of sparse signals," submitted to IEEE Trans. Info. Theory and arXived, March 

2010. 



52 



[58] J. Haupt and R. Nowak, "Signal reconstruction from noisy random projections," IEEE Trans. Info. Theory, vol. 52, pp. 4036 - 4080, 
Sept. 2006. 

[59] J. Haupt and R. Nowak, "Compressive sampling for signal detection," IEEE International Conference on Acoustics Speech and 

Signal Processing, vol. 3, pp. 1509-1512, 2007. 
[60] H. Zhu and G. B. Giannakis, "Exploiting sparse user activity in multiuser detection," IEEE Trans, on Comm., vol. 59, pp. 454 - 

465, Feb. 2011. 

[61] L. Applebaum, W. Bajwa, M. F. Duarte, and R. Calderbank, "Multiuser detection in asynchronous on-off random access channels 
using lasso," in Proc. 48th Annu. Allerton Conf. Comm., Control, and Computing, (Monticello, IL), Sept. 2010. 

[62] L. Applebaum, W. Bajwa, M. Duarte, and R. Calderbank, "Asynchronous code-division random access using convex optimization," 
Submitted for journal publication and arXived, Jan. 2011. 

[63] J. Haupt and R. Nowak, "A generalized restricted isometry property," tech. rep., Univ. of Wisconsin, Department of ECE, 2007. 

[64] D. Guo and C.-C. Wang, "Multiuser detection of sparsely spread CDMA," IEEE Journal On Selected Areas in Comm., vol. 26, 
pp. 421 - 431, April 2008. 

[65] Z. Ben-Haim, Y. C. Eldar, and M. Elad, "Coherence-based performance guarantees for estimating a sparse vector under random 

noise," IEEE Trans. Signal Process., vol. 58, pp. 5030 - 5043, Oct. 2010. 
[66] S. Verdu, "Computational complexity of optimum multiuser detection," Algorithmica, vol. 4, pp. 303 - 312, 1989. 
[67] T. Cai, L. Wang, and G. Xu, "Stable recovery of sparse signals and an oracle inequality," Tech. Rep. [Online]. Available: http://www- 

stat.wharton.upenn.edu/ tcai/paper/Stable-Recovery-MIRpdf, U. Penn., 2009. 
[68] G. Reeves and M. Gastpar, "A note on optimal support recovery in compressed sensing," in Forty-Third Asilomar Conf. on Sig., Sys, 

and Computers, pp. 1576 - 1580, Nov. 2009. 
[69] G. Tang and A. Nehorai, "Performance analysis for sparse support recovery," IEEE Trans. Info. Theory, vol. 56, pp. 1383 - 1399, 

Mar. 2010. 

[70] Y. C. Eldar, P. Kuppinger, and H. Bolcskei, "Block-sparse signals: Uncertainty relations and efficient recovery," IEEE Trans, on 

Signal Process., vol. 58, pp. 3042 - 3054, June 2010. 
[71] R. Lupas and S. Verdu, "Asymptotic efficiency of linear multiuser detectors," in Proc. of 25th Conf. on Decision and Control (CDC), 

pp. 2094 - 2100, Dec. 1986. 

[72] M. Fomasier and H. Rauhut, "Compressive sensing," in Handbook of Mathematical Methods in Imaging (O. Scherzer, ed.), ch. 2, 
Springer, 2011. 

[73] W. Hoeffding, "Probability inequalities for sums of bounded random variables," J. of Amen Stat. Asso., vol. 58, pp. 13 - 30, March 
1963. 

[74] R. Calderbank, S. Jafarpour, and J. Kent, Finding needles in compressed haystacks, ch. 10. Compressed Sensing: Theory and 

Applications, Cambridge University Press, Apr. 2012. 
[75] Z. Sidak, "Rectangular confidence regions for the means of multivariate normal distributions," J. ofAmer. Stat. Asso., vol. 62, pp. 626 

- 633, Jun. 1967. 



